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Abstract 



Search trees are fundamental data structures in computer science. We study functionals on 
random search trees that satisfy recurrence relations of a simple additive form. Many impor- 
tant functionals including the space requirement, internal path length, and the so-called shape 
functional fall under this framework. Our goal is to derive asymptotics of moments and identify 
limiting distributions of these functionals under two commonly studied probability models — the 
random permutation model and the uniform model. 

For the random permutation model, our approach is based on establishing transfer theorems 
that link the order of growth of the input into a particular (deterministic) recurrence to the order 
of growth of the output. For the uniform model, our approach is based on the complex-analytic 
tool of singularity analysis. To facilitate a systematic analysis of these additive functionals we 
extend singularity analysis, a class of methods by which one can translate on a term-by-term 
basis an asymptotic expansion of a functional around its dominant singularity into a correspond- 
ing expansion for the Taylor coefficients of the function. The most important extension is the 
determination of how singularities are composed under the operation of Hadamard product of 
analytic power series. 

The transfer theorems derived are used in conjunction with the method of moments to 
establish limit laws for m-ary search trees under the random permutation model. For the 
uniform model on binary search trees, the extended singularity analysis toolkit is employed 
to establish the asymptotic behavior of the moments of a wide class of functionals. These 
asymptotics are used, again in conjunction with the method of moments, to derive limit laws. 

Advisor: James Allen Fill 

Readers: James Allen Fill and John C. Wierman 
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The time has come," the Walrus said, 
To talk of many things: ..." 

The Walrus and The Carpenter 
Lewis Carroll 
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Chapter 1 



Introduction 

1.1 Background 

We start by giving a brief overview of search trees, the basic object under scrutiny in this 
dissertation. Search trees are fundamental data structures in computer science, their primary 
appHcations being in searching and sorting. For integer m > 2, the m-ary search tree, or 
multiway tree , ge nerahzes the binary search tree. The quantity m is called the branching factor. 
According to 1491 , search trees of branching factors higher than 2 were first suggested by Muntz 
and Uzgalis |56| "to solve internal mernory problems with large quantities of data." For more 
background we refer the reader to 0j Q| and ■ 

An m-ary tree is a rooted tree with at most m "children" for each node (vertex), each child 
of a node being distinguished as one of m possible types. Recursively expressed, an m-ary tree 
either is empty or consists of a distinguished node (called the root) together with an ordered 
m-tuple of subtrees, each of which is an m-ary tree. 

An m-ary search tree is an m-ary tree in which each node has the capacity to contain m — 1 
elements of some linearly ordered set, called the set of keys. In typical implementations of m-ary 
search trees, the keys at each node are stored in increasing order and at each node one has m 
pointers to the subtrees. By spreading the input data in m directions instead of only 2, as is the 
case for a binary search tree, one seeks to have shorter path lengths and thus quicker searches. 

We consider the space of m-ary search trees on n keys, and assume that the keys are linearly 
ordered. Hence, without loss of generality, we can take the set of keys to be [n] := {1,2,..., n}. 
We construct an m-ary search tree from a sequence s of n distinct keys in the following way: 

1. If n < m, then all the keys are stored in the root node in increasing order. 

2. If n > m, then the first m— 1 keys in the sequence are stored in the root in increasing order, 
and the remaining n— (m— 1) keys are stored in the subtrees subject to the condition that 
if (Ji < (72 < • ■ ■ < cfm-i denotes the ordered sequence of keys in the root, then the keys 
in the jth subtree are those that lie between aj-i and aj, where ctq := and am := n-\-l, 
sequenced as in s. 

3. All the subtrees are m-ary search trees that satisfy conditions 1, 2, and 3. 
Figure IT. 1 . II shows the quaternary tree associated with the sequence 

s (10,7,12,4,1,8,5,6,9,14,11,2,15,13,3). 

We may also build an m-ary search tree by inserting the successive elements of a sequence 
into an initially empty tree. This process is illustrated for the sequence s in Figure [1.1.21 

It is our goal to study functionals defined on m-ary search trees under two natural probability 
models. The functionals we consider are of additive type, each induced by a "toll function." 
Informally speaking, the value of an additive functional on an m-ary search tree is recursively 
defined as the sum of the functional's value on the subtrees rooted at the children of the root 
of the tree and a term (the toll) that is a function of the size of the tree. Additive functionals 



2 



1.1. Backirrouiid 
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Figure 1.1.1: The quaternary (i.e., "4-ary") tree associated with the sequence s. 





Figure 1.1.2: Sequential construction of the quaternary search tree from the sequence s. 
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1.2. Probability models 



defined in this manner represent the cost of divide- and- conquer algorithms, where the inherent 
recursive nature of the algorithms lends itself naturally to such a formulation. 



1.2 Probability models 

The two most common probability models on the space of m-ary search trees are the uniform 
model and the random permutation, or random insertion, model. In the case m = 2, the number 
of search trees on n keys is enumerated by the nth Catalan number. Thus the uniform model 
on binary search trees is also referred to as the Catalan model. 

Under the uniform model, the probability of obtaining any particular tree is just the recipro- 
cal of the number of m-ary search trees on n keys. Fill and Dobrow have studied problems 
associated with determining this number. Fill ,21.] argued that the functional corresponding to 
the toll sequence (lnn)„>i serves as a crude measure of the "shape" of a binary search tree, 
and explained how this functional arises in connection with the move-to-root self-organizing 
scheme for dynamic maintenance of binary search trees. He obtained asymptotic information 
about the mean and variance under the Catalan model. (The latter results were rederived in 
the extension IHBl from binary search trees to simply generated rooted trees.) 

In Prop. 2] Flajolet and Steyaert gave order-of-growth information about the mean of 
functionals induced by tolls of the form n" . Takacs established the limiting (Airy) distribution 
of path length in Catalan trees [toI - It^ . which is the additive functional for the toll n — 1. The 
additive functional for the toll arises in the study of the Wiener index of the tree and has 
been analyzed by Janson [4ll |. 

The uniform model on binary search trees has been also been used recently by Janson 
in the analysis of an algorithm of Koda and Ruskey ji^l for listing ideals in a forest poset. 

Now we turn our attention to the random permutation model. Let tt be a random permuta- 
tion of [n]: each of the n! permutations is equally likely to be the value of tt. The distribution 
of trees under the random permutation model is the distribution induced by constructing an 
m-ary search tree from tt in the manner we have described in Section 11.11 For example, for 
m — 3 and n = 4, there are 6 distinct trees each occurring with probability 1/6, as depicted 
in Figure [T. 2. II We note that it is not the case in general that all trees are equiprobable. For 
example, if m = 2 and n ~ 3, there are 5 distinct trees; 4 of these have probability 1/6 each, 
and 1 (corresponding to permutations 213 and 231) has probability 2/6. 

Unlike under the uniform model, functionals of search trees under the random permutation 
model have been studied a great deal, the most attention being paid to binary search trees. 
Indeed, much of the motivation for studying additive functionals comes from the fact that the 
number of key comparisons in Quicksort is such a functional and under the random permutation 
model it corresponds to the cost of Quicksort on uniformly rando m inp ut. There is an extensive 
Uterature on the analysis of Quicksort; see, for example, |1 [11 illiJIil S 111 IM El El El 

functio nals under the random permutation 
for treatment of functionals defined 
i^- Chapter 3 of 0| is a rich 
source of results on the properties of m-ary search trees. 




1.3 Roadmap 

In Chapter 121 we will formally define the problem of additive tree functionals, introducing the 
concept of a toll function. The recurrences obtained will be translated to relations among 
generating functions. The resulting relations will serve as a motivation for our main theoretical 
results — transfer theorems and the extension of the singularity analysis toolkit. In ChapterOwe 
consider the random permutation model and prove general results that relate the asymptotics 
of the toll function to that of the corresponding tree functional. We will then proceed to extend 
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1.3. Roadmap 
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Figure 1.2.1: The distinct ternary trees and the permutations that generate them, for n — A. 



singularity analysis in Chapter 0] so that it is readily apphcable to recurrences obtained under 
the uniform model. 

Next we will consider various applications. In Chapter [S] we will combine the use of the 
extended singularity analysis toolkit and the transfer theorems derived in Chapter |2| to derive a 
complete asymptotic expansion of the mean of the shape functional of an m-ary search tree under 
the random permutation model and the variance for binary search trees. Then in Chapter we 
will use the transfer theorems of Chapter |31 to determine limiting distributions of additive tree 
functionals under the random permutation model. Finally, in Chapters [3 and |H1 the extension 
of singularity analysis to Hadamard products will be applied to the Catalan model for binary 
search trees, resulting in limit laws under two different classes of toll functions. 
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Chapter 2 



Additive tree functionals 

We now formally introduce additive functionals on m-ary search trees. We will provide some 
motivating examples and derive the basic recurrence that governs the behavior of these functions 
under each of the random permutation model and the uniform model. 

2.1 Notation and definitions 

We first establish some notation. Let T be an m-ary search tree. We use |T| to denote the 
number of keys in T. Call a node full if it contains to — 1 keys. For 1 < j < m, let Lj{T) 
denote the jth subtree pendant from the root of T. [The numbering of Li{T), . . . , Lm{T) will 
correspond to left-to- right ordering in our figures.] When it is clear to which tree T we are 
referring we will denote the subtrees simply by Li, . . . , Lm- For x a node in T, write for the 
subtree of T consisting of x and its descendants, with x as root. This notation is illustrated in 
Figure ETU 




Figure 2.1.1: Example of notation for the quaternary tree T of Figures Fl . 1 . II and II . 1 . 21 In this 
example, |T| = 15 and ~ Li. 



Definition 2.1.1. Fix to > 2. We will call a functional / on m-ary search trees an additive 
tree functional if it satisfies the recurrence 

m 

/(r) = ^/(i.(r)) + vi, (2.1.1) 

i=l 

for any tree T with |r| > to — 1. Here (in)n>m-i is a given sequence, henceforth called the toll 
function. 

Note that the recurrence H2.1.1|l does not make any reference to <n for < n < m — 2 nor 
specify /(T) for < |T| < to - 2. 
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2.2. Examples 



2.2 Examples 

Several interesting examples can be cast as additive functionals. 

Example 2.2.1. If we specify f{T) arbitrarily for < \T\ < m— 2 and take i„ = c for n > to— 1, 
we obtain the "additive functional" framework of |49l §3.1]. (Our definition of an additive 
functional generalizes this notion.) In particular if we define /(0) :— and f{T) :— 1 for the 
unique TO-ary search tree T on n keys for 1 < n < m — 2 and let i„ = 1 for n > to — 1, then 
f(T) counts the number of nodes in T and thus gives the space requirement functional discussed 
in [li §3.4]. 

Example 2.2.2. If we define f{T) := when < |T| < to - 2 and i„ := n - (to - 1) for 
n > TO — 1 then / is the internal path length functional discussed in [i^ §3.5]: f{T) is the sum 
of all root-to-key distances in T. 

Example 2.2.3. As noted in Chapter^ we can associate with each permutation of [n] an m-ary 
search tree. Let Q denote the probability mass function of the distribution of trees induced by 
associating an TO-ary search tree with each of the n! (equally likely) permutations of [n]. Dobrow 
and Fill [l3 noted that 

QiT) = (2.2.1) 



where the product in H2.2.1I) is over all full nodes in T. This functional is sometimes called the 
shape functional as it serves as a crude measure of the "shape" of the tree, with "full" trees 
(like the complete tree) achieving larger values of Q. For further discussions along these lines, 
consult (la] and 0. If we define /(T) for < |T| < to - 2 and i„ := ln(„"i) for 
n > TO — 1, then /(T) = — \nQ{T). In this work we will refer to — InQ (rather than Q) as the 
shape functional. 



2.3 A common framework for all moments 

Our aim is to study the distribution of the functional /(T) satisfying 1)2.1.1(1 under the two 
aforementioned probability models. Under the uniform model T is distributed uniformly over 
all TO-ary search trees with n keys, while under the random permutation model T has the 
distribution Q defined at 1)2.2.1(1 . In either case we will use X„ to denote a random variable 
having the distribution of /(T) and define /i„(fc) :— EX^ for fc > 1, the choice of model being 
clear from context. To address the problem recursively we examine the joint distribution of the 
subtree sizes \Li\, . . . , \Ljn\- 



Random permutation model 

Under the random permutation model the joint distribution of the subtree sizes . . . , \ Lm. \ 
is uniform over all j^) TO-tuples of nonnegative integers that sum to n — (to — 1): see |4fl. 
Exercise 3.8]. We now apply the law of total expectation by conditioning on (|Li|, . . . , |im|)- 
Let denote the sum over all m-tuples (ji, . . . ,jm) that sum to n — (to — 1) and the sum 
over all (to -|- l)-tuples (fci, . . . , km+i) of nonnegative integers that sum to k. Then, letting ® 
denote sums of mutually independent random variables, for n > to — 1 we have 

^ln{k) = EE (X^ I . . . , |L„,|) = ^ E (X,, ® • • • e x,^ + t^f 

= r^^^L k k }j^'n{kl)■■■HAk..)ti-^'. 

\m-l) j k V^l' ■ ■ ■ ' ''■"1+1/ 
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2.3. A common framework for all moments 



We can rewrite this equation as 



ri-(m-l) , _ 

Mn(fc) = 7^ E r (2.3.1) 

\m-l) j=0 ^ ^ / 

where 

rn{k):^J2* (. I . ]t-^'J^Y.^'nikl)■■■Hr.[km), (2.3.2) 

with denoting the same sum as with the additional restriction that ki < k for i = 
l,...,m. We have thus established that the moments /x„(fc) each satisfy the same basic re- 
currence in n, differing as k varies only in the non- homogeneous term rn{k). Observe that 
?'n(l) = tn, the toll function. We record this important fact as 

Proposition 2.3.1. Under the random permutation model all moments of an additive functional 
satisfy the basic recurrence 

n— (m— 1) , 

m \ ^ I n — 1 — ] 



an = hn + Y-^ ( m_2 )"^' n>m-l, (2.3.3) 

with specified initial conditions (say) aj '■= bj, < j < m ^ 2. [Recall the statement following 
Definition \2.1.l\ about the initial conditions for the recurrence (|2.1.1|l ./ 

In order to analyze the recurrence relation H2.3.3() we introduce generating functions 
A{z) :— E a„z" and B{z) ^ 

Furthermore, let 

^:=n(^+^) 



n=0 n=0 



r-1 

denote the rth rising factorial power of x and 

r-l 



fc=0 



the rth falling factorial power of x. Multiplying l|2.3.3(l by n ™ ^ z" and summing over 

n > m — 1 we get the differential equation 

^ B^"'-^\z) + to!(1 - z)-("-iU(z). (2.3.4) 

Equations of the form H2.3.4|) are members of a class known as Euler differential equations. 
Using the method of variation of parameters and combinatorial identities one can obtain the 
general solution to this equation. We only state the results here, referring the interested reader 
to Appendix fXl 

Theorem 2.3.2. Let A and B denote the respective ordinary generating functions of the se- 
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2.3. A common framework for all moments 



quences (an) and (6„) in Provosition \2. 3. 71 Then 

A{z) = ^^(1 - ^)^'^' + E / s^""'HC)(i - 0'^+"-' dC, (2.3.5) 

where ip is the indicial polynomial 

V'(A) := A"^ - m! = A(A + 1) • • • (A + m - 2) - to! (2.3.6) 

with roots 2 Ai, A2, . • . , Kn-i in nonincreasing order of real parts. In (I2.3.5|l . the coefficients 
Ci, C2, . . . , Cm-i can he written explicitly in terms of the initial conditions bo, ... , 6^-2 as 

Proof. See Appendix 1X1 in particular Corollary I A. 31 and Proposition 

The indicial polynomial H2.3.()l) is well studied; see |^ and Appendix We will 

exploit the expression 12.3.5() for A{z) to relate the asymptotic properties of the sequence (6„) 
to those of (a„) in ChapterOand then, in ChapterEl use the transfer theorems of ChapterOto 
derive limiting distributions of additive functionals. 



Uniform model for binary trees 

We now turn our attention to the uniform model for binary search trees. The techniques we 
develop in this dissertation can be extended to handle the uniform model on m-ary search trees 
as well, although much of that analysis will involve singularity analysis of implicitly defined 
functions. Our techniques can also be extended to handle any simply generated family of 
trees [sllegj. 

It is well known that the number of binary trees on n nodes is counted by the 71th Catalan 
number (see, e.g., Exercise 6.1.19c]) 

n + 1 \ n 
with generating function 

00 

CAT(z) Z^"-^" Vl^). (2.3.8) 

n=0 

Let fin{k) be defined as the fcth moment of the random variable Al„ and /2n(fc) := /3„^„(fc)/4". 
By conditioning on the key stored at the root, noting that the probability is /3i-i/3n-i/Pn that 
the key stored at the root is /, 1 < ^ < n, and multiplying throughout by /3n/4" we get, for 
n > 1, 

1 " f{ 

Mk) = 2 E Sr^Ai^-i(fc) + ^""(^)' (2-3-9) 
3=1 

where 

^n{k):^- J2 U i. t ^«'E'^^-i(^i)^"-^(^2). (2.3.10) 

ki+k2 + k3=k ^ ' ' / j = l 
ki,k2<k 
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Again we observe that moments of all order satisfy the same basic recurrence, with 
We define generating functions 

oo oo oo 

Mfe(^) :=^/i„(fc)z", := ^f„(fc)z", T{z) -.^ J^*^^" ■ (2-3-11) 

n— n— n— 

Then, usiiiff ipm^ and ifTT^ . we get 

M,(z) = -^, (2.3.12) 

so to analyze Mk{z) it is sufficient to analyze Rk{z). With this in mind we introduce the 
Hadamard product of two power series. 

Definition 2.3.3. The Hadamard product of two power series F and G, denoted by F{z)qG(z) 
or {F G){z), is the power series defined by 

F{z)QGiz) :=^/„5„z", 

n 

where 

F(z) = Z"^" and G(z) = 5].9„z". 

n n 

In light of this definition, the defining relation for Rk{z) at H2.3.10|l leads to the following 
relation in terms of generating functions: 

Rk{z)= J2 .\)™)°'^0[|m,,(z)M,,(z)], (2.3.13) 

ki+k2+k3 = k ^ 1' 2' 

ki,k2<k 

where, for k a nonnegative integer, 

\Qk 



T{zy^^ ■.= T{z)Q---QT{z). 



Our approach for getting asymptotic expansions for moments of all orders relies on singularity 
analysis of generating functions jsj] , a tool that, under suitable conditions, allows the translation 
of asymptotics of generating functions to asymptotics of the underlying sequences. In order to 
analyze Mk{z) and Rk{z) inductively (in k), we would like to be able to predict how singularities 
are composed under Hadamard products. This is precisely the problem studied in Chapter 01 
We will, in Chapters [3 and |H1 use the results derived in Chapter 01 to get limiting distributions 
induced by two types of toll functions. 
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Part II 
Theory 



Chapter 3 

Random permutation model: transfer theorems 

In Proposition 12 . 3 . Tl we have seen that under the random permutation model, moments of each 
specified order for any additive functional on m-ary search trees satisfy a recurrence relation of 
the form 

an^K + , n X ^ i TO-2 ]"^' ri>m-\, (3.0.1) 

with specified initial conditions aj := bj for j = 0, ... ,m — 2. By Theorem 12.3.21 the output 
generating function 



oo 
n=0 



A{z) :=^a 

n— 

is given in terms of the input generating function 

oo 

by 



m— 1 m— 1 ,^ \ — \- fZ 

A{z) = ^^(1 - ^y^' + E ^TYT^ / S^'"~'^(C)(i - 0^'+'"-^ dC, 



where il) is the indicial polynomial 



with roots 2 =: Ai, A2, . . . , Am_i listed in nonincreasing order of real parts. We refer the reader 
to Appendix IbI for relevant facts about this polynomial, and recall from there the notation 

a max 3fJ(Aj). 

2<j<m— 1 

The coefficients ci, C2, . . . , c„i-i can be written explicitly in terms of the initial conditions 
6o,...,6m-2- In particular, by Proposition IA.41 with j = 1 and 5c'^''(0) = k\bk there, we get 
[also using IjB.Sp ] 

1 b 

« = «^ g (JTTWT2)- '"-^i 

Here Hm is the mth Harmonic number: Hm = SjLi J- 

In this chapter we will obtain results of both exact and asymptotic natures that link the 
behavior of the output sequence (o„) to that of the input sequence in 1)3.0.1(1 . 

Note. In the sequel whenever multi-valued functions are encountered, we will assume that 
we follow the principal branch. 
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3.1. The Exact Transfer Theorem 



3.1 The Exact Transfer Theorem 

Our main result repeats (|2.3.5|l and then gives an alternative exact transfer from B to A: 
Theorem 3.1.1 (Exact Transfer Theorem (ETT)). Let 

m—2 oo 

B{z) := B{z) - J2 ^J"^^ = J2 (^•^•1) 

j — n—m— 1 



Then 



A{z) = ^^-(1 - ^y^' + E ^TTTT^ / B^^-^\Q){1 - C)^.+"-2 dC (3.1.2) 

m—l m — l \ — \ 

= ^c,(l-.)-^^■+B(z) + m!^^iA_ / B(C)(l-C)'^^'rfC. (3.1.3) 

Proof. We begin with H3.1.2() , repeated from Theorem 12.3.21 and note that B there can be 
replaced by B. We then use repeated integration by parts and invoke Identitv lB.3l Denoting 



A{z) :=A(z)-^c,(l 



z] 



-A, 



after m — 2 integrations by parts we find 

^(^) = E ^,(^ (^.- + - 2) • • • (A, + 1) y i?'(C)(l - C)'^- dC- 



But 

(Aj + m - 2) . . . (Aj- + 1) = 

so 



A™ + m\ to! 

Aj Aj Xj 



^ AjV"(Aj) Jo 

We obtain H3. 1.3(1 by performing one more integration by parts and utilizing Idcntitv IB . II with 
A = 0. □ 

Remark 3.1.2. For computations, (|3.1.2|l might be easier to use when it is no bother to compute 
derivatives of B; otherwise, (I3.1.3|l is easier. Equation H3.1.3() will be used in establishing part (a) 
of the Asymptotic Transfer Theorem in Section the proof of part (b) will use 1(3. 1.2(1 . 



3.2 The Asymptotic Transfer Theorem 

It is quite easy to transfer asymptotics for B to asymptotics for A using the ETT. We give three 
examples important for applications to moments of additive functionals in the next theorem, 
proved independently in 8J| using a quite different approach. 

Theorem 3.2.1 (Asymptotic Transfer Theorem (ATT)). Suppose that (a„) and (6„) 
satisfy the recurrence ((3.0.1(1 . 
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3.2. The Asymptotic Transfer Theorem 



N If 



bn = o{n) and 7 — ^ ^ w — ^ 2) ^'^™^'''9^^ (conditionally), (3.2.1) 



then 



Ki b, 



Hm — 



—n + o(n), where Ki :— y -4— 

1 ^ ^ ^ .7 + 1 .7 



^.T-oO- + l)(j+2) 



(3.2.2) 



(b) If bn = K2{n + 1) + hn where h„ satisfies (|3.2.1|l [with (bn) replaced by (hn)], then 



Htti, 1 Hm. 1 



where 



^(.7 + 1) (.7 



^t^(j + l)0- + 2) 



H„, - 1 



- 1 



hIS> 1 

2{Hm - 1) 



(c) Ifbn = K^n-" + o(7i^(^)) with 3?(w) > 1, then 



a - ^ + oin^^"'^) 

r(ii+m) 



(3.2.3) 



(3.2.4) 



(3.2.5) 



Before giving the proof we estabHsh a few lemmas. The first result is found in the proof 
of Lemma 6 of 8]. For completeness we include a proof. Here, as throughout, [z"]F(z) is the 
coefficient of z" in the series expansion of F. 

Lemma 3.2.2. Let Y{z) denote the ordinary generating Junction of the sequence (yn) with 
yo — 0- For any A G C, 



Proof. The function 



/ (i-c)-^nc)rfcUE^ n (1 



A - 1 



fc=0 j=k+2 



W^(z) := (1 - z)--^ [l-Qf-'YiOdC 



is the unique solution with M^(0) = to the differential equation 

W'{z) = A(l - z)-^W{z) + (1 - z)-^Y{z) 
that is, Wn '■— [z'^]W{z) for n > satisfies wq — Q and 



^ n— 1 -j^ n — 1 

-y^Wk + -'S^Vk, n>l. 



n '■ — ' 71 



n > 0. (3.2.6) 



(3.2.7) 



To solve the recurrence (|3.2.7f) we compute nw„ — (n — \')Wn-\ — Xwn-i + 27n-i so that 



w„= [1 



A- 1 



Wn-l + -Vn^l, n > 1. 

n 



Now we iterate to get the result. 



□ 
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3.2. The Asymptotic Transfer Theorem 



The product in (|3.2.6|l may be written (when A G C \ {0, —1, —2, . . .}) as 

r(A + n)r(2 + fc) 

r(i + n)r(A + fc + i)' 

which by StirUng's formula equals 



(fc + l)^-i[l + 0((fc + l)-i) 



(3.2.. 



for n > 1 and fc > 1. [The product in equals even if A e {0, -1, -2, . . .}. ] Also, 

of special interest is the case A = 2, in which case (|3.2.6|l reduces to 

a-cmc)^c)-(n+i)E ^,^^^^,^,) , n>o. (3.2.9) 

For part (a) of the ATT we use the following estimates from |8j. These follow readily from 
Lemma E2I21 (EZHl), and 

Lemma 3.2.3. (a) Ifdi{X) < 2 and 

OO 

n=0 

satisfies tjq = and ?/„ = o{n), then 

(^(1 - z)-' y(C)(i - C)'-' dc) = o{n). 

(h) With B defined at if (|XTT|l holds, then 

For part (c) of the ATT, we will need the following simple comparison lemma. 

Lemma 3.2.4. If {b„) and (fe^J are two toll sequences such that 

\bn\<bn foralln>0, 

then the corresponding output sequences (a„) and (aj^) in (|3.0.1|l (with the initial conditions 
stated there) satisfy 

1^11 1 < o.'n fo''' "^^^ n > 0. 
Proof. This follows immediately by induction. □ 

Proof of Theorem Unii Part (a) follows immediately from (pTTT|l . LemmaEZSl ^E^, 

and 

For part (b) suppose first that 6„ = ?i + 1. Then -B(z) = (1 — z)^^, so 

B(™-i)(z) = - 
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3.2. The Asymptotic Transfer Theorem 



Plugging this into (|3.1.2|l we find 



a-n = [n + 1) 



Cl + TO, 



E 

J=2 



1 



Recall Identitv lB.4l and since 5„ = n + 1, we have in this case by Ij^. 0.2(1 . that ci = 1. Therefore 



(n + 1) 1 + - 



-nHn 



rr{2) _ , 



If 
1 



(ij„ 



- 1 



H„ 

(2) 



-[(n + l)i7„-n] + o(7i) 



1 2{Hm - 1) 



n + o(n). 



This completes the proof of part (b) for our special case, and the general case follows from this 
and part (a) using the superposition principle. 

Finally for part (c) suppose first that 6„ = {v + l)"/n! ~ /r{v + 1), so that B{z) ~ 
and = + Plugging this into ^^T^ and utilizing 

Identity IB. II with X = v + 1 and the calculation 



[v + 1) 



m—1 



we find 



A{z) = 



{v + - m 

mW{v + 1) 



TO!r(D + 1) 



1 - 



T{v + m) 



(l-z)-(^+l)+0(|l-z|-2). 



T{v + to) 

By singularity analysis (see Theorem 14. 1.2|l . this completes the proof of part (c) for our special 
case. 

To complete the proof (by superposition) in the general case, we need only show that if 
bn = o(n") for real v > 1, then a„ = 0(71"). Indeed, fix e > 0; then there exists a sequence 
(&^) such that \bn\ < b',^ for all n and b'^^ = e{v + l)"/n! for all large n. This toll sequence is 
but a slight modification of our special-case toll sequence, and we see that its output generating 
function has coefficients 



e'n" + o(n'"), where e' 



r(z; + 1) 



TO!r(i; + 1) 



T{v + to) 



Now Lemma [3.2.41 implies that 

limsup„|a„|n~" < e'; 
since e (and hence e') can be made arbitrarily small, this completes the proof. 



□ 



The conditions ((3.2. l|l on 6„ are not only sufficient but also necessary for asymptotic linearity 
of a„. Indeed, here is a converse: 

Proposition 3.2.5. //a„ — Kn + o(n) for some constant K, then ((3.2.1() holds. 

Proof. Proposition 7 of Q provides the simple proof that 5„ — o{n). Moreover, from ((3.1.3(1 . 
the value of Ci at 1(3.0.2(1 . and 1(3.2.9(1 with Y taken to be B, we find that 



E 

3=0 



(j + l)(j + 2) 



= if(i/„-l) + o(l). 
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3.2. The Asymptotic Transfer Theorem 



i.e., the series 

\ ^ o„ 

converges [conditionally, to K{Hm — I)]- D 

The following additional asymptotic transfer results are established by calculations similar 
to those in the proof of the ATT. 

Definition 3.2.6. A nonnegative function L{n) defined for n > no > and not identically zero 
is called slowly varying if 

Lin) 

-—, ^1 as n (X) 

L{\tn\) 

for every t > Q. If ng > 0, we define L{n) = for < n < riQ. 

Examples of slowly varying functions include any powers of Inn or In Inn. The key property 
of slowly varying functions that we employ is the regular variation fact 0, 1.5.9a], that 



,k<r 



Theorem 3.2.7. Consider the initial value problem Ij'i. 3.3(1 . 

(a) // 2 < m < 26 and 6„ = o{'\/n), then we can refine 1(3. 2. 2() to 

«" = Tj^^ i " + (3.2.11) 

n-m — t 

(h) //3?(A2) —: a < \ + (3 and bn ~ n^Lin) with L slowly varying, then we can refine ((3.2.2|l 
to 



Hm-l m!-(l + /3)™-i 
(c) If hn ^ nL(n) with L slowly varying, then, with Ki defined at (|3.2.2|l . 

m 



(3.2.12) 




*/ 2^ < oo, 



oo. 



(d) Part (c) of the ATT can he extended as follows. If 6„ = K^n^Lin) + o(n*(")^(")) with 
^{v) > 1 and L slowly varying, then 

n^L{n)+o{n'^^^)L{n)). (3.2.14) 



-, _ m!r(D+l) 
T(v+m) 
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3.2. The Asymptotic Transfer Theorem 



Proof sketch. Whenever the conditions H3.2.1|) are met we have by the ETT and (|3.2.9|l 



1 ^ — h 

-(n + l) = 0(n"-i) + 6„ (n+l)V- 

H,n-V ' ^ ' " H,n-1^ ' {k + l){k + 2) 

k—n 

E^7(A;)[^"]((1-^)"'^7o B(Oil-0''-'dCy (3.2.15) 



m 

^2 ^'^-^ 



where a := 3fi(A2) is the real part of the root of V' with second largest real part, which is smaller 
than 3/2 when m < 26 (see Appendix IB|| . Simple estimates, including the use of (|3.2.8|l . give 
cases (a) and (b); for (b), the coefficient of n^L{n) in H3.2.12|l indeed is, using Identitv IB. II 
and (E 



(1 - mm - 1 • f; ((1 + /3) - A,)^'(A,) m! - (1 + Z?)"-! ■ 
In case (c), from the ETT result (|3.1.3|l and simple estimates we find 



To finish, we use the regular variation fact H3.2.1U|I . 

In case (d), again from (|3.1.3|l and simple estimates we find 

m— 1 ^ 

a„ = 0{n) + 6„ + Ki7i"L{n)m\ V ,, , ^ — + o(n''i(n)). 

^ V'(Aj)(v + l-Aj) 

The proof is completed by using Identitv lB.il □ 
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Chapter 4 

Singularity analysis of Hadamard products 

4.1 Background 

Singularity analysis is "a class of methods by which one can translate, on a term-by-term basis, 
an asymptotic expansion of a function around a dominant singularity into a corresponding 
asymptotic expansion for the Taylor coefficients of the function" js^l- In general, it provides 
sufficient conditions for the validity of the implications 

F{z) = 0{G{z)) =^ /„ = 0(5„) 
Fiz) - o(G(z)) =^ /„ = o(3„) 
F{z) ^ G{z) =^ /„ ~ g„, 

where F and G are the ordinary generating functions of the sequences (/„) and (gn) respectively. 
The sufficiency conditions are expressed in terms of the following analytic continuation condition. 

Definition 4.1.1. A function defined by a Taylor series with radius of convergence equal to 1 
is said to be A-regular if, with the sole exception of z = 1, it can be analytically continued in a 
domain 

A(</., 7^) {z : |z| < 1 + 77, I arg (z - 1)| > (4.1.1) 

for some t] > Q and < < 7r/2. 

A function / is said to admit a IS.- singular expansion at z = 1 if it is A-regular and 

N 

/(z) = 5]c„(l-zr+0(|l-zn (4.1.2) 

n=0 

as z — > 1 in A(0, 77), for a sequence of complex numbers {cn)n=o ^^'-^ ^ sequence of real numbers 
(an)^=o satisfying a„ < A. 

Note that under these assumptions the O(-) error term in (|4.1.2|l holds uniformly in z G 
A(0, 77). We also allow in the usual way infinite asymptotic expansions 

/(z)~^c„(l-z)"" 

n>0 

to represent an infinite collection of expansions of type (|4.1.2|l . When it is convenient, we will 
also allow powers of ln((l — z)~^) in singular expansions. The region of analytic continuation 
in the complex plane is depicted in Figure 1^.1. II We will sometimes refer to such a region as an 
indented crown. 

For completeness we state relevant results from 31] and 28]. These will be used frequently 
from here on. 

Theorem 4.1.2 (Theorem 1 in |3lj ]l. Let f{z) be a A-regular function in A{(j),ri) and assume 
that, as z 1 in A(0, 77), 

/(z) = 0(]l-z]-") 
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4.1. Background 




Figure 4.1.1: The region A{(j>,ri) in Definition 14. 1.11 



for some real number a. Then the nth Taylor coefficient of f{z) satisfies 

fn - [^"]/(^) = 

Flajolet also showed [l^ that singularity analysis is likewise applicable to the generalized 
polylogarithm Jj'ia^riz) defined for a nonnegative integer r and an arbitrary complex number a 
and \z\ < 1 as 

Li„,(z) g ii^z". (4.1.3) 

n=l 

Theorem 4.1.3 (Theorem 1 in [28]). The function Liafi{z) is analytic in the slit plane 
C \ [1, +oo). For a ^ {1, 2, . . .} the function Liafi{z) satisfies the singular expansion 

u^,oiz) ~ r(i - a)r-i + J2 ^^C(a - jW, 

with 

{i-zY 



i = - In z = ^ 



I 

1=1 

as z — !■ 1 in the sector — tt + e < arg (1 — z) < tt — e for any e > 0. 
For r — 1,2, . . the singular expansion of LIq^C-z) is obtained by 

gr 

Lia,r(-z) = (-1)''^^ LiQ,o(2:)- 
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4.2. Singular expansions and differentiation 



The case when a = ?Ti G {1, 2, . . .} is also handled in [isl p. 380]: 

Li™,,(z)=Res((-l)'-C(")(s + m)r(s)t-^),=i_™+ ^ _ j)^.- (4.1.4) 

J>0,i#m-1 

with t = — Inz, where Res(-) denotes a residue. 

Singularity analysis is a very useful tool in the analysis of combinatorial structures mainly 
because generating functions of structures that are constructed combinatorially readily admit 
the technical conditions needed for its application. In such cases precise asymptotic expansions 
of the coefficients can be derived. In the next two sections we will extend singularity analysis 
so that it is applicable to the right-hand side of (|2.3.13|) . 



4.2 Singular expansions and differentiation 

In preparation for the treatment of Hadamard products in Section [4. HI we need a theorem that 
enables us to differentiate local expansions of analytic functions around a singularity. Such a 
result cannot of course be unconditionally true; see, for example, H4.2.5|l . However, it turns out 
that the class of functions amenable to singularity analysis satisfy this property. 

Theorem 4.2.1. // f{z) is A-regular and admits a IS.- expansion near its singularity in the 
sense of Definition \4-.l.l\ then for each integer fc > 0, j^fiz) is also A-regular and admits a 
A-expansion obtained through term-by-term differentiation: 

ri>0 ^ " ' 

Proof. Clearly, all that is required is to establish 

-f0(|l-zn = 0(|l-z|^-i). 
az 

Let us define g{z) :— f{z)/{l ~ z)^ where f{z) — 0(|1 — z\^). By differentiation, we find 

{l-z)A 

This can be rewritten as 

f'[z)^g'{z){l-zf-A^^. 

Now g{z) is by assumption 0{\) at z = 1. If we assume that g'{z) = 0{\1 — z\^^) at z = 1, we 
obtain the sought resuff: f'{z) = 0(|1 - Thus we need only prove the implication 

g{z) = 0(1) =^ g\z) = 0(|1 - z\-'), (4.2.1) 

or, in more symbolic terms, 

-^0(1) = O 

dz 

We are going to prove (|4.2.1|l in a domain A((/) + e, 77), where e > should be fixed but can 
be taken as small as we please. The starting point is Cauchy integral formula 



27ri L, w 
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4.2. Singular expansions and differentiation 



which, through differentiation, provides 




Figure 4.2. f: The contour 7 used in the proof of the differentiation theorem. 



We select a small e > and restrict z to the domain A((/) + e, The integral is estimated 
using the contour 7 depicted in Figure l4.2.fl 7 := 71 U 72 U 73 U 74, with 



71 
72 

73 
74 



^{w:\w~\\^ \\z ~ 1|, I arg(u; - 1)| > 0}, 

= {w: jlz — 1| < — 1|, |w| < l + ?7, arg(t(; — 1) = 0}, 

= {w: \w\ = f + ?7, |arg(u;- 1)| > 0}, 

= {w: i|z — 1| < — 1|, |w| < l + ?7, arg(w — f ) = —<f\. 



We then decompose the integral giving 5' (z) as J — J +J +J +/■ Along 71 we apply 
trivial upper bounds: 



• O (^^) • - z|) = 0(|1 - zr^). 

The integral along the external circle 73 satisfies trivially 

^0{l)^Oi\l-z\-'). 
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(4.2.2) 



(4.2.3) 



If we set |1 — z| = (5, we find that the integrals along 72 and 74 are of no bigger order than 



5/2 



(52 



^L,. 0(1 1^0(11 -.,-■). 
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Thus we liave 



Oi\l-z\-^\ 



(4.2.4) 



Combining the estimates (|4.2.2|I - H4.2.4|1 yields the asymptotic form of g'{z) as 0(|1 — z\ ^), as 

□ 



required at (|4.2.1|l . 

Thus, for instance, taking 

g{z) = cos In 



1-z 



and g'{z) 



1 - z 



■ sin In 



1 - z 



we correctly predict that g{z) ~ 0{1) =^ g'{z) — 0(|1 — z\ ^). On the other hand, the apparent 
paradox given by the pair 



g{z) = cosexp 



1 - z 



and g'{z) 



(1-z): 



■ exp 



1-z 



sm exp 



1 - z 



(4.2.5) 



is resolved by observing that in no nondegenerate sector around z = 1 do we have g{z) — 0{1). 

It is also well known that integration of asymptotic expansions is usually easier than differ- 
entiation, and we can indeed prove the following theorem. 

Theorem 4.2.2. If f{z) has a A-expansion that reduces to f{z) = 0(|1 - z\^\ log (1 - z)\^), 
then we have 

'0(|l-z|^+i|log(l-z)|^) tfA<~l 
0(|log(l-z)|^+i) ifA^-l,B^-l 
f{t)dt = { 0(|loglog(l - z)-i|) ifA^ -1, B = -l 

fit) dt + 0(|1 - z|-^+i| log (1 - z)f) ifA> -1. 

Proof. The A-expansion condition tells us that |/(z)| < K\l — z|'^|ln(l — z)\^ for some < 
K <oo. 

Suppose A < —1. By Cauchy integral formula, we can choose any path of integration that 
stays within the region of analyticity of /. We choose the contour 7 := 71 U 72, shown in 

Figure |32!3 
Then 



fit) dt 



< 



fit)dt 



fit)dt 



fit)dt+ / fit)dt 

'71 •'72 

<K f \1 - t\^\\ii il -t)\^ dt + K f |l-i|^|ln(l-t)|^|di|. 
The last integral along 72 is 

|1 - z\^\ In (1 - z)\^ [ \dt\ = 0(|1 - log (1 - z)n. 

The last integral along 71 is a real integral. Define x :— 1— |1 — z|, so that the last integral 



23 



4.2. Singular cxijuusioiis and (lilR^K^iitialion 



71 



72 



\ 1\ 



Figure 4.2.2: The contour used in the proof of the integration theorem. 



along 7i equals 

r(l-i)^(ln(l-t)-i)^rft 

Jo 

(1 - x)^+\ln (1 - x)-Y -B I (1 - t)^{\n (1 - t)-^)^-^ dt 

Jo 



\A\-1 



We have shown that 



^(1 - x)^+'iln (1 - x)-Y + o - t)\\og (1 - i)-')^ dt^ . 



^ |1 - t\^\ In (1 - dt={l + - a;)^+i(ln (1 - x)-'r 



= (1 + o(l))^^|l - z\^+\\n |1 - 
= {l + o{l))^^^\l-zf+'\\n{l-z) 

= 0(11-^1^+^1 log (1-^)-^)- 



Hence 



/ f{t)dt = 0{\l-z\''+^\\og{l-z)\% 
The case A — —1 is similar. As above, 

j f{t)dt <K I \l-t\-^\\n{l-t)f dt + K I \l - t\-^\\n{l - t)f \dt\. 
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4.3. Hadamard products 



The integral here along 72 is 

|l-z|-i|ln(l-z)|^ / = 0(1 log (1-^)1^) = 0(1 log (1-^)1^+1). 



The integral along 71 is a real integral. If B ^ —1, then setting a; := 1 — |1 — z\, we integrate to 
get 

\l - t)-\\n (1 - t)-Y dt = -^(In (1 - x)-Y+' = Oil log (1 - ^)|^+^). 



If B = -1, 



f{t)dt 



71 



<0(1) + X / (l-i)"'(ln(l-t)"')"'di = 0(|loglog(l-z)-i|). 

1/2 



Combining the two integrals gives us the result. 

The case A > — 1 is obtained by observing that the asymptotic condition guarantees the 
existence of and by decomposing = + . The contour of integration is the line 
segment joining 1 to z. It is enough to show that 



f{t)dt 



= 0(|l-z|^+i|log(l-z)|^). 



Indeed, performing the change of variable t — uz + (I ~ u), we can bound the absolute value of 
this integral by K times 



ll-zl^^M^Iln {u{l- z))\^\l- z\du 

\ . /|ln(«(l-z))r ^ 



^|l-z|^+i|ln(l-z)|^ 
since the last integral is bounded 



ln(l - z) 



du = Oi\l ~ z\^+'\log{l - z)\^) 



□ 



4.3 Hadamard products 

In this section we examine the way singular expansions are composed under Hadamard products. 
The Hadamard product is a bilinear form. So if we have a set of functions admitting known 
singular expansions, we need to establish their composition law, and this will give composition 
rules for finite terminating expansions. In order to extend this to asymptotic expansions, either 
finite with Odl-zl-^) error terms or infinite, we need to establish a theorem providing the form 
of 

o{\i-z\^)QO{\i~zn. 

The composition rule for polylogarithms [see 14.1.3|) ] is trivial, since 

Lia,r(2:) © Li/3,s(2:) = 'Lia+l3,r+s{z). 

However, polylogarithms do not have a simple composition rule with respect to ordinary prod- 
ucts. 

We thus turn to the composition rule for the basis formed by functions of the form (1 — z)"", 
where a may be any complex number. From the expansion 
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4.3. Hadamard products 



around the origin, we get through term-by-term muhiphcation, 

The classical hypergeometric function of Gauss is defined for 7 not a nonpositive integer by 

a(3 z a{a + l)(3{f3 + 1) 



2 



Fi[a,/3;7;z] = 1 + 



7 1! 7(7 + 1) 2! 



so that l|4.3.1|) reduces to 

(1 ~ z)-" (1 - zr" = 2Fi[a, 13; 1; z]. (4.3.2) 

From the transformation theory of hypergeometrics — see, e.g., |37l p. 163] — we know that, in 
general, hypergeometrics can be expanded in the vicinity of z = 1 by means of the z 1 — z 
transformation. If we instantiate this transformation with 7=1, we get 

2Fi[a,/?;l;z] = Jl-^^^l^ 2^^^ [a, /3; « + /3; 1 - z] 

+ ^v(\rm!^ ^^ - ^r"''^' 2Fi[l - a, 1 - /3; 2 - a - /3; 1 - z]. (4.3.3) 

r(a)r(/3) 

In other words, we can state the following theorem. 

Proposition 4.3.1. When a, [3, and a -\- (3 are not integers, the Hadamard product 

(1 - z)-" (1 - z)-'3 

has an infinite /^.-expansion with exponent scale {0, 1, 2, . . .} U {—a — /? + 1, —a — /3 4- 2, . . 
namely, 



fc! +2^^'= k\ 



fc>0 ' fc>0 

where the coefficients A and fi are given by 

^ r(i-Q-/3) 

Til - a)T{l - 13) {a + 



^ r(a + /?-l)(l-a)fe(l-/3)fe 
r(a)r(/3) (2-a-/3)^- ' 

We are thus facing a situation where polylogarithms are simple for Hadamard products and 
complicated for ordinary products, with the reverse situation for power functions. 

We now examine how O(-) terms get composed. The task is easier when the resulting function 
becomes large at its singularity. Fortunately, thanks to the results of Section ^21 all cases can 
be reduced (see Proposition ^^31 to the following. 

Proposition 4.3.2. Assume that f{z) and g(z) are A-regular in Altpo, rj). Additionally, sup- 
pose that 

f{z) = 0(|1 - zl'^) and g{z) ^ 0(|1 - z|'') as z ^ 1, 
where a and b satisfy a + b + 1 < 0. Then the Hadamard product (/ g){z) is A-regular and 
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admits the expansion 

ifQg){z)^Oi\l-zr+'+'). 

Proof. We first give the proof in tlie simpler case when/(2) and g(z) are analytic in the whole 
of the complex plane slit along z > 1. 

Assume that s is a complex number satisfying \s\ < 1. Then consider the integral 

/H.(-)-, (4.3.4) 

taken (counterclockwise) along a contour 70 (see Figure l4. 3. l|l which is simply a circle of radius 
R centered at the origin and such that |s| < i? < 1. Setting 

f{z) = J2 Z"-^" a^'i 9iz) = 3nz", 

n>a Ti>0 

and computing the integral (|4.3.4|l by expanding the functions, we find 

ri>0 

This is the classical formula of Hadamard for Hadamard products. 




Figure 4.3.1: The contour 70. 



We propose to continue / g to a point z such that |z| > 1 where z/ s is real and positive. 
Because of the analytic continuation properties of / and we can take as the contour of 
integration in (|4.3.4|l the contour 7 depicted in Figure 14.3.21 The quantity f & g evaluated at 
s is still equal to the integral form along 7 since there is a continuous deformation from 70 
to 7 such that, along the intermediate contours, both w and s/w stay within the domains of 
analyticity of / and g. 

We now consider the integral in (|4.3.4|) as a function of the parameter s, and let s vary 
continuously from z along the ray from the origin to z. The integrand remains an analytic 
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function of both s and z so that the integral (|4.3.4|l provides the analytic continuation of / g. 
Using this contour, we thus capture the singular behaviors of both / and g. 

In order to obtain the asymptotics for / we fully specify in Figure 14.3.31 the geometry 
of the contour 7. Without loss of generality, we assume that z is in the upper half-plane. 




Figure 4.3.3: The geometry of the contour 7. 



Define 



y := arg z, 
a2 :=Re'^'+*\ 



m := arcsm — , 
^ R 

Pi := z - c5e'^^+'i\ 
f32 := z + c5e^'^+^\ 
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where we can (it is not hard to see) and do choose < c < j sin -00 small enough that 7 
(and its interior) stays entirely within the interior of Aijjjo^rj). (The reason for the restriction 
c < J sin^pQ will become clear later.) 

Then, the contour 7 is 71 U 72 U 73 U 74, where 

71 = {w: \w\ = R, argw e [0, 6* - U [6* + 27r]}, 

72 ^{w:w = (1 -t)ai +t/3i, < t < 1}, 

73 = {w^: I'"' ~ ^1 = arg(w7 — z) G [0, tt]}, 

74 = {w: w = to2 + (1 - t)(32, < t < 1}. 

By the asymptotic conditions on / and g, we know that there are positive constants Ki and 
K2 such that f{w) < Ki\l — w\'^ and g{w) < K2\l — for all w on 7. We also note that 
R < \w\ < 1 + r], so we can find a constant K such that 



l(/0 5)WI = 





1 










< 


-f 




27r \ 



wJ w 



71 •'72 •'Ta •'74 



(4.3.5) 



We now show that each of the four integrals in (|4.3.5II is 0((5°+''+^) = 0(|1 — z|"+^+-'^), completing 
the proof of (|02J). 

1. Uniformly along 71, the integrand is 0(1). Hence the integral is 0(1), which is 0((5"+^+^) 
since a + 5 + 1 < 0. 

2. Along 72, we can express the variable of integration w as 

w = w{u) — {ai — (3i)u + Pi, < u < 1, 

so that 

dw = (ai — Pi)du. 

Note that \Pi — ai\ = (1 + o(l))(l — R). The integral of interest is then bounded by a 
constant times 

/>oo 

\l-w{u)\''\w{u)- z\''du. 

We split this integral into two pieces, one where u < M6, say 721, and the other where 
u > MS, say 722, where M is a suitably chosen large constant. We will show that each of 
these integrals is 0((5°+^+^). To this end, observe that 

1 - u> = 1 - /3i + (/3i - ai)u =l-z + c5e'^^+^^ + u\(3i - ai\e'^ 

= e\e-'^ - \z\ + ic8 + u|/3i - ail), (4.3.6) 

We defer to Lemma [4 .3. 31 a calculation that establishes |1 — = ri((5) uniformly along all 
of 72. To get an upper bound, using the triangle inequality, 

|1 - w| < |l-/3i| + < (5 + c,5+ = (c+ 1)<5+ w|. 

Restricting attention to 721 gives |/3i — w\ < — ai\MS. Hence, uniformly along 721, 
\l-w\^e{S). 

For |w — z|, a trivial lower bound is — z] > \Pi — z\ = cS. Along 721, an upper bound 
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is established by the triangle inequality as 

\w-z\< |/3i - z\ + \w- Pi\<c6+\f3i_- ai\Md. 

Hence |w — z| = 0(6) uniformly along 721, so that the integral along 721 is since 
the integrand is uniformly 8((5°+'') and the path of integration is of length <d{d). 

For the integral along 722, we will establish that |1 — w| — 0(u) and |w — z| — 0(u) for 
suitably large M. The desired bound on the integral will then follow, since a + b + 1 < 0. 

We use (|4.3.6|l and the fact that the real part of a complex number is a lower bound on 
its absolute value to get 

|1 - w| > cos 6' - \z\ + \/3i - ai\u. 
Writing z = 1 + Se^"^ , we note that 

1 + (5 cos lb 1 + (5 cos lb 1 — 5 

cos e = — — - > > > 1 - 2(5. 

\z\ - 1 + 5 -1 + 5- 

Hence 

|1 - w| > 1 - 2(5 - (1 + (5) + (1 + o(l))(l - R)u = (1 + o(l))(l - R)u - 3(5. 

Since u > MS, if we choose M large enough, we get |1 — w\ = n{u). 
For the upper bound, 

\l-w\< \w-l3i\ + \l-f3i\ = |ai-/3i|u+|l-/3i| < {l + o{l)){l~ R)u+{c+l)6 = 0{u). 
Hence |1 — = 8(u). 

The estimate for |w — z\ is similar. For the upper bound, observe that 

w~z — l3i~z + (ai — Pi)u, 

whence 

\w-z\< \(3i -z\ + |ai -/3i|uc(5+ (1 + 0(1)) (1 = 0(m). 

For the lower bound, we express 

w-z = -^cSe^^+^^ + u\ai - Pi\e''^ ^ e'" {u\ai -(3i\- icS), 

so that 

\w-z\ = \u\ai - - ic5\ > u\ai - Pi\ = (1 + o(l))(l - R)u. 
This establishes |w — z| = il{u) and hence |w — z| = Q(u). 

3. Along 73, |w — = c(5, and (1 — c)d = (5 — c5<|l — t(;|<(5 + C(5=(l + c)6 using 
the triangle inequality. The contour of integration has length ttcS. Hence the integral is 

4. Along 74, the argument is similar to (and somewhat simpler than) that for the integral 
along 72 . 

□ 
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We now present the proof of the lower-bound on |1 — w| along 72 used in the proof of 
Proposition 14.3.21 

Lemma 4.3.3. In the notation of the proof of Proposition \4-.!^.^ — f^(^) uniformly along 

all of 72 . 

Proof. If sinO > 2c6, we lower-bound |1 — using H4.3.t)|) and the fact that the imaginary part 
of a complex number is a lower bound on the its absolute value, 

|1 - w| > -Q(e"'''' - |z| -|-ic(5-|-u|/3i - ai|) = sinO ~c5 > cS. 

Otherwise, we use the real part as a lower bound: 

\l-w\> 5R(e"^^ - \z\ + icS + u\f3i - ai|) > cos6l- \z\+u\Pi - ai\. 

We choose c small enough so that if 3?(z) = 1 — |2;|cos6' < 0, then sin 6* > 2cS. Indeed, if 
3f?(z) > 1, then z = 1 + Se""^ where i/'o < V" < 7^/2, and so 

5 sin Tp 5 sin ■00 
sm — — -- — > 

\z\ 2 

and any c < j sin -00 will suffice. So we may assume 

sin 6* < 2cS and 1 - |z|cos6' > 0. 

Now cos^ 9 > 1 — 4c^(S^, whence cos9 = 1 — o{S), and we need only prove that 1 — |z| > 
and l-\z\> n{5). To this end, observe that = \1 - (1 - \z\ cos6')2 -|- (|z| sin6')2, so that 



1 = |z|cos6l + (1 - |z|cos6l) > \z\^/l - Ac^S^ + - Ac'^\z\^ 

> |z|(l-4c2^2)^^(^_4^2|^|2-)^ 

and hence, for all sufficiently small S, 

l-\z\>S{l-Ac^\zf)^4.\z\c^S'^ 

> (5(1 - 16c2) - 80^52 (since \z\ < 2 for (5 < 1) 
>pS, 

where p is a positive constant depending only on tpo, since c < j simpo j- ^ 

Next we show how to modify Proposition 14 . 3 . 2l whenever a + b > —2. 
Proposition 4.3.4. Assume that f{z) and g{z) are /^.-regular and that 

f{z) = 0(|1 - zl") and g{z) = 0(|1 - z\^) as z ^ I. 

(a) Ifk<a + b+ l<k+l for some integer —l<k< 00, then 

k 

if © 5) W - E ^^(/ - + o(|i - zr+'^+i). 



(b) If a + b + I is a nonnegative integer then 

a+b 

I 

J 



if 9)iz) = J2 9)^'^im + o(ii - zr+^+ij iog((i - z)-i)i). 



3=0 
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Proof. Let D denote the operator ^ and let 9 denote the operator zD. 
(a) We first show that 

k 

if 9){^) = E [^'(/ -9)] 1^^)' + - ^1"+"+')- 



(4.3.7) 



The proof is by induction on k. The basis case of fc = — 1 is a special case of Proposition lUT^ 
Observe that 

OifQ9) = iOf)Qg 

and by Theorem [4 . 2 . II that 

{9f)iz)^zfiz)=0{\l-zr'). 
If /c > 0, then by the induction hypothesis 



k-l 



mf 9)] (c) = {9f 5)(c) - E 4^ [^'(^/ f )] ^^(y + - cr+')- 



We divide by ( and then integrate both sides with respect to ( from z to 1 to get 



fe-i 



(/ 9){z) = (/ 5)(1) + E 7-TW [^'^' ® 5)] (1)(- In zY^' + 0{\1 - zr+^+i) 

i=o ^' 

= E 4^ [^'(/ 5)] (1)(- InzF + 0(|1 - zr+^+^), 

as required. 

To obtain the expansion in the asserted form, we use the following two well-known facts: 
a) For fixed nonnegative integers j and fc, as z ^ 1 we have 

k 



{-lnzy^j\J2 



(1-zr 



+ 0(11-^^1), 



where [^] denotes a Stirling number of the first kind (see 1.2.9-(26) in [i^l. 
b) The operators D and 6 satisfy 



61^ r>l. 
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Using these facts in (14. 3. 711 and recalling that a + 6+ l<fc+l,we get 

k k 



J2{-iy[&'{fQ9)] (i)E 

j=0 r=0 

k ,{i-zy 



(i-zY 



+ 0(11-21"+"+^) 



(/0 5)(1) + E(-1)''W^E(-1)''"' , [0'ifQg)]{l) + O{\l-zy+''+') 



,(i-zr 



(/0.9)WH] (l) + 0(|l_zr+''+i) 



r=0 
k 



1=0 ^' 



as desired. 



(b) The structure of the proof is the same as that of part (a). One first proves a result similar 
to H4.3.7|l . Again, the proof is by induction onfc:=a + 6 + l. When fc = 0, after applying 
Theorem 14.2.11 and Proposition 14.3.^ to obtain 9{f g){z) — 0{\1 — z\~^) we can apply 
Theorem 14 . 2 . 21 to get {f Q g){z) = 0{\ log((l — z)~^)\). The induction step is similar to that 
in (a) above. Finally, using the same argument as for part (a), we obtain the expansion in 
the asserted form. 

□ 



For convenience, we now combine Propositions 14.3.21 and 14.3.41 to get a result covering all 
possible values of a and b. 

Theorem 4.3.5. /// and g are amenable to singularity analysis and 

f{z) = 0{\l-z\'') and g{z) ^ 0{\1 - z\'') as z ^ I, 

then f Q g is also amenable to singularity analysis. Furthermore, 
(a) lfa + b+l<0 then 

f{z)Qg{z)^0{\l-z\^+'+'). 



(b) Ifk<a + b+ l<k+l for some integer —l<k< oo, then 



f{z) g{z) = E - + o{\i- ^r+^+i). 

J=0 ^' 



(c) If a + b + I is a nonnegative integer then 



fiz) g{z) = E —j^(f dY'Hm - ^y + - zr+^+i| log ((i - z)-')\y 



3=0 



J 



We will make extensive use of Theorem 14.3.51 in the subsequent chapters. We mention in 
passing that the results of this chapter are also applicable to the analysis of the Cayley tree 
recurrence mill, which models the evolution of a graph from being totally disconnected to 
being tree-like, when successive edges are added at random; see j23] for details. 
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Chapter 5 



Random permutation model: shape functional 

As our first application of the theoretical results of Chapters |31 and 01 we demonstrate how to 
get a complete asymptotic expansion for the mean of the shape functional (see Example 12. 2. 3|) 
for m-ary search trees and we determine the asymptotics of its variance for binary search trees. 
Our results generalize those obtained in [2H . 

5.1 Complete asymptotic expansion of the mean 



As noted in Example 12. 2. HI the shape functional corresponds to the toll function t„ :— In (,^" j^). 
In order to obtain a complete asymptotic expansion of its mean using the ETT fTheorem l3.1.1|l . 
our first task is to obtain a complete asymptotic expansion for its generating function 

00 



The generating function of the toll sequence 

By definition 



where 



and 



But 



n— m— 1 71—711— 1 

00 fm— 2 



In 



E 



n—m—l L ^— 



^ln(n-0 



m — ly 

-[In(m-l)!] ^ z" = P(z) - Q(z), 



Piz)--= E 



n—m—l L ^— 



'm-2 



^ln(n-0 



Q{z) -.^ [In (m - 1)!] ^ z" ^ [In (m - 1)!] 



m— 2 00 



m— 2 00 



m— 2 m— 2 



^W=E E [ln(n-0]^"=E^' E [ln(n-0]^"-'-E E [1^ (n - /)] z" 

m— 2 n— 1 

■Lio,i(z)- ^z"^ln(n-0, (5.1.1) 



1 - z''' 



1 - z 



n=l 1=0 
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where Li denotes the generahzed polylogarithm defined at (|4.1.3|l . The term subtracted in H5.1.1|) 
equals 



m— 2 m— 2 n 

' n 



Ez"lnn!=5:^(-1)'= (1-zfln 



m — 2 m— 2 ^ m— 2 m — 2 ^ - 

= 5:(-lf(l-z)'= ^ ninn!=^(-l)'=(l-zf ^ (;)lnn- (5.1.2) 

fc=0 n=max{fe4} ^ ^ fe=0 n=ft 

while the first term in H5.1.1|) is the product of (1 — z)^^ Lio,i(z) and 

rn— 1 ^ \ m— 1 ^ 

1 - ^"-^ = 1 - E (-1)' ( " ) (1 - = E (-1)'^' ( " r ) (1 - ^)'' 

i.e., it is Lio.i(z) times 

l-z™-i ,i./to-1 



fc=o ^ 
Using Theorem 14 . 1 . 31 we know that 

Lioa(^) ^ t-' Int-' + r'(l)<-i - E ^^C'(-J>^■ (5.1.4) 

as z —s- 1 in a suitable indented crown, with t ;= — Inz. We note that r'(l) = —7, where 
7 = 0.5772 is Euler's constant. 

In order to carry out the integration required in the ETT, we need to express T{z) in terms 
of powers of (1 — z) and ln((l — z)~^). This is our first goal. [For notational convenience we 
will define L{z) :— In ((1 — z)^^).] To this end, expression (|5.1.2|l can be written as 



m—2 m—2 / \ 

J2 Si{k){l ~ z)'' where 5i(fc) (-1)'= E ( J l^"'- 

fc=0 Tl = fc ^ ^ 



Expression H5.1.3|) can be written as 



E^2(fc)(l-^)^ where S^ik) := (-1)' ("^ ^ ^) . (5.1.6) 



fe=0 

For ()5.1.4|l . we will need the following two facts: First, 



^^" = ETl(r,J)(l-^)^ J>-1 where Ti(r, z) | f ^^'^^^ ^ (5.1.7) 



Next, 



Int-^ =Liz) + J2T2ir)il- zY, where r2(r) E ^^^3(^ + 0, (5.1.8) 

r=l 1=1 * 

r3(r,i) := K][L(u;) (5.1.9) 
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The final ingredient in our expression for T{z) is 

771—2 rn—2 m—2n 



III — z lib — z in — z 11 / \ 

= ^ (1 - (1 - = E E u ) ^-^^'(^ - 

n=0 n=0 n=0 ^=0 ^ ^ 

m— 2 771—2,'. m—2 

= J2{i-zr{-irj2[k) = E^3(A:)(i-.f, (s.i.io) 



fc=0 n=fe ^ ^ fe=0 

where 



m—z / \ 

r) — I- ^ ' 



n=fc 

We now express Lio,i(^) in terms of (1 — z): 

Lio,i(^)~ (^E (^L(z) + El^2(r)(l-zrj 

+r'(i) E Ti(r,-i)(i-zr-E^c'(-j)E^i(^'^')(i-^r 

r=-l j>0 r=j 



oo 



= E Ti(r,-l)(l-zrL(z)+ E Ti(r,-l)(l-zrET^2(j)(l-^)^' 

r=— 1 r= — 1 j=l 

oo oo ( \\3 

+r'(i) E Ti(r,-i)(i-^r+E(i-^rE^c'(-jm(r,j) 

r=-l r=0 j=0 *'* 

oo oo oo 

= E Ti(r,-l)(l-^rL(z)+ E Y.^Ar.-^)Ti{m-zY^= 

r=— 1 r= — 1 j=l 



+ r'(i) E ri(r,-i)(i-^r + EW(i-^) 



r=0 



where 



Hence 



E ^^C'(-i)Ti(r,i). (5.1.12) 



OO OO 



Lio,i(^)~ E E E ri(r,-l)T2(i-r)(l-^) 

r=— 1 r=— l_7=r+l 



oo 



+ r'(l) E Ti(r,-l)(l-^r + E^4(r)(l-^) 



r=0 

cx) j-1 



= E Ty{T-\){\-zYL{z)^Y.^\-zy E Ti(r,-l)T2(i-r) 

r=— 1 j=0 r=— 1 



oo 



+ r'(l) E Ti(r,-l)(l-^r + E^4(r)(l--^y 

r— — 1 r— 

: E Ti(r,-l)(l-^rL(^)+ E ^5(r)(l-^r, 

r=— 1 r=— 1 
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where 



'r'(i)ri(-i,i) 



r = -1 



55 (r) := 



r'(l)ri(r,-l) + 54(r)+ J2 Ti{j,~l)T2ir-j) r > 0. 



(5.1.13) 



Now we multiply ()5.1.3|l and (|5.1.4|l , using H5.1.6f) and H5.1.12|l , to get 

Cm — 2 \ / oo oo \ 

m—2 oo m— 2 oo 

= E E 52(fcm(r,-l)(l-zf+'-L(z)+^ 5] 52(fc)55(r)(l-z)'=+'- 



m — 2 oo m — 2 oo 

= E E 52m(r-fc,-l)(l-zrL(z)+^ ^ 52(fc)55(r-fc)(l-zr 

oo min {r+l,m— 2} 

= X](l-zri(z) ^ 52(fc)ri(r-fc,-l) 

k^O 

oo min {r+l,m — 2} 

+ E(l-^)'" E 52(fc)55(r-fc) 

r=-l fc=0 



= J2 S,{r){l-zYL{z)+ J2 S7{r){l-zy 



r=-l 



r=-l 



where 

mill {r+l, 771—2} min {r+l,r77— 2} 

56(r):= S2{k)Ti{r-k-l) and SyW := ^ 52(fc)55(r - fc). 

fc=0 fc=0 

(5.1.14) 

We are now ready to plug H5.1.14f) . H5.1.5|) . and (|5.1.10|) into the expression for T{z). This gives 
us 



T(z)~ ^ 56(r)(l-^)'-L(^)+ ^ 57(r)(l-z)'- 



r=-l 
m-2 



5^5i(fc)(l-z)^--[ln(m-l)!] 



fc=0 



771-2 



(l-z)-i-^53(fc)(l-^)^- 



£ S,{t){1-zYL{z)+ J2 Ssir){l~zy 



(5.1.15) 



where 



'S'7(-l) -ln(m- 1)! r = -l 

Ss{r) := { Svir) - Si{r) + [In (m - l)!]53(r) < r < m - 2 
5*7 (r) r > TO — 1. 



(5.1.16) 
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Application of the ETT 

We now apply the ETT f Theorem 13. For our toll function, 

m — 1 

f{z) ■.^T{z)-J2tnz'^T{z). 

3=0 

Thus, the generating function of the mean of the shape functional, call it Gi, is given by 

G,{z)^T{z) + m\Y^ ^ I / T{C){l~0^^-'dC. (5.1.17) 
We rewrite (j5. 1.15(1 as 

oo oo 

T{z) ^ - ^y^'L{z) + Sio{r){l ~ zy+\ (5.1.18) 

r=-2 r=-2 

where 

S^ir) Se{r + 1) and Sw{r) := Ssir + 1). (5.1.19) 

For a fixed A G C, the terms (1 — z)^^'^L{z) and (1 — z)^^^ are both integrable (resp., neither 
integrable) at z = 1 iff r > - {^{X)] (resp., iff r < -[3fi(A)]). 
We therefore write 

T(C)(l-C)^"'dC 

C=o 

Y r [S^ir)LiC) + SMr)]il - 0^+"- dC (5.1.20a) 



r hc)(l - C)'~^ - E iS,{r)L{0 + 5io(r)](l - 0^+'- 



dC. (5.1.20b) 



The integrand in ((5.1.20b|l is integrable at C = 1 and is asymptotically equivalent to 

J2 [59(r)L(C)+5lo(r)](l-C)'+^ 

r>-m\)] 

as C ^ 1 in the sector — tt + e < arg {1 — () < tt — e for any e > 0. 

For (|5.1.20a|) . we have the following formulas, valid for any a with 'Si{a) > —1: 

(i-CTdC^ — ^[(i-z)"+i-i], 



and 



■(1 - crm dc = — ^(1 - zr+'Liz) - ^^ai - zr+' - 1]. 

a + 1 (a + 1)^ 



In the special case that m is odd and A = Xm-i = and r = m — 1, we also need the formulas 

{I - C)-^ dC = L{z) 
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and 



C=o 



{i-cr'LiOdc^-L^z). 



We now conclude that (|5.1.20a|) equals, except in the special case described above, 



E 

-2<r<-|"sR(A)] 



X + r + 1 



(A + r + l)2 

Sdjr) ^ Siojr) _ Swjr) _ ^^A+r+i 



(A + r + l)2 A + r + l A + r + 1 
In the special case (to is odd, A — m, r = m — 1), H5.1.2Ua|l equals 



(5.1.21) 



E 



A + r + 1 



(l-z)^+'^+^L(z) 



(A + r + 1): 



,(l_^)A+r+i^ 



(A + r + 1)2 



A+r+1 A+r+1 



In the case m is even. 



"-1 n _ ^\-\ 



1 



+ ^Sgim - l)L^{z) + S'io(to - l)L(z). (5.1.22) 



^ V''(A,) 



A=A, 



m — 1 



E ^'(A,) E 

<S'io(r) 59(r) 



^9(0 



A, + r + 1 



(l-z)'-+^L(z) 



Aj+r + 1 (Aj+r + l)2 



il-zY+' + 



{Xj + r + 1)2 Aj + r + 1 



'13, 



where 



[Sii{r)il - zYL{z) + 5i2(r)(l - zf] + Si 

r=-l 



(5.1.23) 



^iiM:=- E 



V''(A,)(A,+r)' 



^i2(r):=- ^ 



j:di{\,)<-r 

1 /59(r-l) , 5io(r-l) 



j:5R(A,)<-r 



V;'(A,) V(A,+r)2 A,+r 



and 



771—1 ^ 

E E 



r=-ldi{\j)<-r 

If TO is odd, we need to add 



g9(r-l) 5io(r-l) 
(Aj + r)2 A, + r + 1 



(5.1.24) 



^ ^ '-59(to-1)L2(^) + ^^o(™_1)L(z) 



V''(-m) V2 

to the right-hand-side of H5.1.23|l to get the left-hand-size of l|5. 1.23(1 
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5.1. Complete asymptotic expansion of the mean 



For (|5.1.20b|l . we define 

MO :=T(C)(l-C)'-^ 



[^9(r)L(z) + 5ioW](l-C)'+^ 

-2<r<~[3i{X)] 



(5.1.25) 



so that H5.1.20bp equals K{\) — / J\{0 dC, wfiere 

Jc=z 



K{X) := / JA(C)dCeC. 



(5.1.26) 



Now we make use of 

r-l 



r>-r!R(A)l 



JxiOdC^ J2 / 59(r)L(C)+5io(r) (l-O^+'-rfC (5.1.27) 



Again, exact integration is possible: for all a e C with 5R(a) > — 1, 



1 



(i-C)"dC = — — (1-z) 

a + l 



a+l 



(1 - 'ic = ^(1 - .)"-i^(^) + (^(1 - .) 



Q + 1 



Therefore 



r>-r3?(A)l 



-A + r + 1 



_ ,)A+^+i + ^ ^^"^(^) ^ (1 _ ,) 



A+r+l 



(A + r + l)2 



A + r + 1 



We can thus conclude 



m— 1 



^V''(A,)^ ^ ^V''(A,) ^.A 



A,- + r + 1 



{l-zr+'L{z) 



Sgir) , Sio{r) 



{Xj + r + 1)2 + r + 1 



(1-z) 



r+1 



m— 1 



E 57T^(l - ^)"'^' + E [^i4(r)(l - z)'-i(z) + 5i5(r)(l - zf] (5.1.28) 

j=l V y j> r>-l 



where 



^14(0 := - E 



j-M{Xj)>-r 



S9ir - 1) 
(A,+r)V/(A,) 



^15(0 := - E 



59(r-l) , 5io(r-l) 



j:K(A,)>-r 



^'(A,)L(A,+r)2 A,+r 
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5.1. Complete asymptotic expansion of the mean 



Case m even 

Using ^rrm . ^rrm . and xrrm m ^rrm . we get 



+ [^-(1 - + Lr{l - zf], (5.1.29) 



r>-l 



where 



K{\) := m\K{\) and M := ni\S 



13, 



(5.1.30) 



m— 1 



Kr := m![5ii(r) + 5i4(r)] + 56(r) = -m!^9(r - 1) 



^ (A,+r)V''(A,) 



+ ^6(0, 



m\Sg{r — 1) 
(-r)"^ - m! 



^6(r) 



(5.1.31) 



and 



L, :=m![^i2(r) + 5i5(r)]+58(r) 

m— 1 



-m 



^9(r-l)5] 



1 



m— 1 



(A,+r)2Vy(A,) 



^io(r-l)5] 



^ (A,+r)^'(A,) 



(5.1.32) 



Case m odd 

In this case we have the same resuh as ()5.1.29|l except that the r — m term must be replaced 

by 



These constants can be obtained in a closed form using Identities IB . II and IB . 21 



(1 - z)" 
^/''(— m) 



Sg{m-l)L'iz) + Si„{m-l)L{z) 



+ Su{m){l - z)"'L{z) + 5i5(m)(l - + S6{m){l - z^Liz) + S?,{m){l - z)" 

= N^{1 - z)^L\z) + K^{1 - z^Liz) + Lm{l - z)^, 



where 



and 



to!S'9(to - 1) 59(to-1) 
2ip'[-m) 2{Hm - 1) 



S'io(m) 



V''(— to) 



+ S'i4(to) +S'6(to), 



L„ := TO!5i5(m) + 58(to). 

(We note that Km and im defined here for m odd are different [compare (|5.1.31|l and (|5.1.r 
from those defined for to even.) The constants can be simplified using Identities IB. 51 and IB 



42 



•j.l. Coiuplcli" asyuiplolic cxpaiibiou of tlw^ iiKniii 



Indeed, 



Su{m) 



E 



Saim-l) 



-S,{m-1) ^ 



(A,-+m)^'(A,) ' ^ (A,+mW(A,-) 



SQ{m - 1) 
2m! 



(2) 



and 



^9(m - 1) 
m! 



(2) 



12 



5io(to - 1) 
2 • m! 



3 - 1)2 4 - 1)3 



(i^m - 1)2 



Some observations 

The following facts can be established for the constants. 
Remark 5.1.1. = Q iov Q < r < m - 2. 



Proof. 



i — /ml ™~ 



(_r)m-l _ to! (_^)m-l _ to! 

which contains the factor OifO<r<m — 2. 

Remark 5.1.2. i^.i = -1. 

Proof. 

(m-1)! S'6(-l) (m-l)Ti(-l,-l) 



(_r)m-l _to!' 



i^_l = 56(-l)-, , ^ 

(m — 1)! — mi 1 — m 
= _Ti(-l,-l) = -[^-i]i(^) = -l. 



Remark 5.1.3. Km-i — for all odd values of m. 
Proof. When m is odd, 

56(m- l)(-l)™-i(m- 1)1 



1 — m 



m— 1 



(-1)™-i(to-1)!-to! 



Seim ~ 1) 
m — I 



□ 



□ 



43 



5.1. Complete asymptotic expansion of the mean 



Now 56(to — 1) is the coefficient of (1 — z)™ ^L{z) in the product H5.1.3|) x (I5.1.4|l . i.e., 
Seim - 1) = [(1 - f iTOl) X (I^THl ) 

= [(1 - zr-'L{z)] -^—{- Inz)-iL(z) 



= [(1- 



1 - z 

1 - 



1 - z 

" ^ H ^L{z) 
To see the last equality, observe that 

1 - (1 - z)™-i 



(-Inz)- 



L(z) 



and 



[z™] / (l-z)^da; = [z"] 

Jx=0 



m — 1 



(1 - z)^ dx 



x=0 



m-1 oo 



x=0 



E 



^ r—z dx = — r 



m — 1 



{~x)"'dx. 



x=0 



But the last integral vanishes because its integrand is antisymmetric about the midpoint of the 
interval of integration: 



-(m - 1 - x)" = (-l)'"(m - 1 - x),n = (-1)" - a;™ = -(-x)™, 
the last equality following from the assumption that m is odd. 
Remark 5.1.4. 



□ 



In (to — 1)! TO 
1^-1 = -H„i-i +7. 

TO — 1 TO — 1 



Proof. 



L_i = ml 



— 771 ' 



H, 



ru — 1 



(to — 1)!(to — 1)-^ 

Hm-1 



(to — 1)!(to — 1)^ 



+ 5io(-2) 

-^8(-l)( 



1 



(to — 1)! — to! 
1 

(to — 1)!(to — 1) 



)] +^8(-l) 



Now 5*6 (—1) = TO— 1 as noted in the proof of R,emark l5.1.2l and 5*8 (—1) — —(m— 1)7— In (to — 1)!. 
Plugging these in the display above, we get the desired result. □ 

The complete asymptotic expansion 

The computations in this section have thus established that the generating function of the mean 
of the shape functional for TO-ary search trees under the random permutation model, denoted 
by Gi(z), has the complete asymptotic expansion 



+ ^i.(l-z)''+ [Kr{l-zyL{z) + Lr{l-zy] (5.1.33) 

r— r>m— 1 
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5.2. The variance for binary search trees 



when m is even and 

m— 1 / \ \ m— 1 

+ iV^(l - zrL\z) + X„,(l - zY^L{z) + - z)" + [^'-(1 - ^)'^(^) + ^-(1 - 

r>rn+l 

when m is odd. 

Using singularity analysis f Theorem 14. 1.2|l . the complete asymptotic expansion can be trans- 
lated on a term-by-term basis to a complete asymptotic expansion of the mean itself, denoted 
by /i„. Thus 

j = l ^ y 0) ■ r>m-l 

when m is even and 

m— 1 



j = l ^ y J' r>m+l 



^"-E^i-^» + ^-i + ^™^"H(^))+^™^'"(e))+ E KrA^m 



when m is odd, where A''(s) denotes the rth-order difference of a sequence s — (s„). Note that 
A'-((l/n)) = e(n-(''+i)) and A™((2i7„_i/n)) = e(n-(™+i) logn). 

The binary search tree 

In preparation for the computation of the variance of the shape functional for the binary search 
trees we state a special case of the complete asymptotic expansion derived above. When m = 2, 
the toll function t„ equals Inn and the indicial polynomial iJjiX) equals A — 2, with root Ai = 2. 
Thus H5.1.34() , Remark 15.1.41 and the well-known expansion for the harmonic numbers 0i 
1.2.7-(3)] immediately yield 

2" 1 

/x„ = K(2)— -H,,~2 + -f + 0(n-^) = K(2)(n 1) - Inn - 2 -I- Ofn^^), (5.1.35) 

n! 2n 

where, by SFTTm . SFTTm . and STTm . 

In k 



/I „i oo oo 1 

_ r(C)(l~C)rfC = 2 ^(lnA:)C'=(l-C)dC = 2^— - 
•''^-^ k=l k=l ^ 

5.2 The variance for binary search trees 



)(fc + 2) 



Let Xn denote the mean of the shape functional for binary search trees under the random 
permutation model. In the sequel it will be convenient to work with the approximately centered 
random variable Xn defined as 

Xn := Xn - Ci(n + 1) for n > 0, 

where Ci := K{2), and to define 

ilnik) := E X^i forfc>l. 
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5.2. The variance for binary search trees 



with fin := 0. Then, foUowing the arguments leading to (|2.3.1|) and (|2.3.2|l . we have 



2 



"(2) = -E^j(2)+r„(2), 



n 



where 



r„(2):= E ^niki)H.ik2) 



fel+fc2+fe3 = 2 jlj2>0 

fci<2, /C2<2 

n— 1 ri— 1 



(hin)2 + 4(lnn)- ^ /2j(l) + " II 

n— 1 

= (lnn)2 + 4(lnn)- ^(/., - + 1)) 



2 



71 

and ro(2) = EXq = Cf. Note that knowing and //n(2) is enough to compute the variance 

of the random variable call it af^, since 

= VarXn = A«(2) - An(l)'. (5.2.1) 

Our proposal is to use the ETT fTheorem again. To that end, let B{z) be the 

generating function of r„(2). Then using the expression for r„(2) above we get 



S(z) = C2 + Lio.2(z) + 4Lio.i(z)0 / (1- 0-^^1(0 -Ci(l-C)-')dC 

+ 2 r [G,(0 - Ci{l - Cr^Y dC (5.2.2) 

Our goal is to get each term in this sum to 0(|1 — z\^'^) for arbitrarily small e > 0. 
Using Theorem 14 . 1 . 31 we get 

Li„,o(^) - r(l - a)e-^ + ^^C(a - j)t\ 
Li„,i(z) ^ r'(l - a)p-i - r(l - a)r-i Int - ^ ^^C'(a " jW , 

and 

Lia,2(2) r"(l - a)t"-i - 2r'(l - a)^-^ hit 

+ r(l - a)t«-i(lnt)2 + ^ ^C"(« - J)i^ 

where ^ 

t= - lnz = (l-z) + ^i^^ + 0(|l-z|3). 
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5.2. The variance for binary search trees 



Hence 

t-i = (l-z)-i-i + 0(|l-z|) 
and [recaUing L{z) :— ln((l — z)^"'^)] 

hit = -L(z) + i(l-z) + 0(|l-z|2). 

This gives 

(Int)^ = L\z) - (1 - z)L{z) + 0(|(1 ~ zfL{z)\), 

so that 

t-\\ntf = (1 - z)-^L\z) - \l\z) - Liz) + 0(|(1 - z)Liz)\) 

and 

Int = -(1 - z)-^L{z) + ^L{z) + 0(1). 
Combining these expansions we get 

Lio,i W = (1 - z)-'L{z) + r'(l)(l - - ^L{z) + 0(1) (5.2.3) 

and 

Lio,2(z) = (1 - z)-^L'(z) - h^z) - L{z) 

2r'(l) [-(1 - z)-'L{z) + iL(z)] + r"(i)(i - z)-i + o(i) 
- (1 - z)-^L^{z) + 2r'(l)(l - z)-iL(z) 

+ r"(l)(l - z)-i - ^L\z) (r'(l) + l)L(z) + 0(1). (5.2.4) 

We also need an asymptotic expansion for the generating function Gi{z) — 01(1 — z)^'^ of 
/2„(1). This is available to us using (|5. 1.33(1 . Indeed, using M = from H5.1.3()|l . Lq = 1/4 
from H5.1.32|l . i_i = —(2-7) from Remark [5.1.41 and Ki = —1/36 from H5.1.31|l [and the 
definitions ^5.1.1ij| . (|5.1.14| . (jCT7|) . ^5.1.i>4|) ]. we get 

Gi(^) - Oi(l - z)-2 = _(i _ z)-ii(z) + L_i(l - z)-i + i - 1(1 - z)L(z) + 0(|1 - z|). 
Thus, invoking Theorem l4.2.2l we get 

^il^Cr'[Gl{0-C^il-0-']dC 
Jo 

= - cr'm + - cr' + cr' j^m + o(i)] dc 

= -[(1 - zr'Liz) - (1 - z)-i] + L_i(l - z)-i + 1l(z) + 0(1) 
- -(1 - z)-'L{z) + (L_i + 1)(1 - z)-i + ii(z) + 0(1), 
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5.2. The variance for binary search trees 



and 

[Gi(z) - - z)-2]2 ^ (1 _ zy^L'iz) - 2L_i(l - z)-^L{z) + L\{\ - z)-^ 

- \{l ^r'Liz) + - + 0(|L2(z)|). (5.2.5) 

We now integrate l|5.2.5(l term by term. Integration by parts establishes 

■(i-cri'(c)dc = 

^ -(1 - zY+'L\z) -^(1 - zY+'Liz) -^[(1 - zy+' - 1] (r ^ -1) 



r + 1 (r + 1)2 ' (r + 1)3 



and 



f — (l-z)'^+^L(z)- , ^ ,J l-zY+^+ , V, (r7^-l) 

(i-C)^MC)rfC= / + ! + + 

[2^^'^ (^ = -1). 

By Theorem 14 . 2 . 21 the integral of the remainder in H5.2.5|l is 0(1) = 0(|1 — z]"*^). Hence 



r[Gi(c)-ci(i-c)-']'dc 

Jo 



= (1 - z)-^L\z) - 2(1 - z)-^L{z) + 2(1 - z)-i 

- 2i_i[(l - z)-iL(z) - (1 - z)-i] + L^i(l - z)-i + 0(|1 - zj-') 
= (1 - z)-1l2(^) _ 2(L_i + 1)(1 - z)-^L{z) + (2 + 2L_i + L^i)(l - z)-^ + 0(|1 - z]"^). 

(5.2.6) 

We now have an asymptotic expansion for each term in the sum ifFT^ but the one involving 
the Hadamard product. We make use of the following facts. 

1. (1 — z)^^ is the identity element for Hadamard products, i.e., (1 — z)^^ f{z) — f{z). 

2. L{z) = 0(|1 - z|-") for any e > 0, so that (1 - z)"^£(z) 0(|1 - z\-^-''). Hence, by 
Theorem 031 [(1 - z)-^L{z)] 0(1) = 0{\l - zY"). 

3. Similarly, 

[(1 - z)-^L{z)] L{z) - 0(|1 - z\-'-), L{z) L(z) - 0(1), 
L(z) 0(1) =0(1), O(1)0O(1) =0(1). 

4. The following lemma will also be used. 

Lemma 5.2.1. [(1 - z)-^L{z)] [(1 - z)-^L{z)] = (1 - z)-^L^{z) + (1 - z)-^ Li2,o(z). 
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5.2. The variance for binary search trees 



Proof. Let iJ„ denote the nth harmonic number. Then 
[(1 - zr'L{z)] [(1 - z)-iL(z)] = ^ i/^" = E f E ^) ^'^ 

n>\ \k=\ / n>- 



E E J- 



^"-E^ E^" U^E 

A;>1 \n>/c / 



E 



n>l \_k=2 



k>l 



n>l 



E 

.k=2 



Hk-1 



The first sum is (f — z) ^ Li2,o(2). For the second sum, observe that 



EE 



n>l 



Hi 



fc-i 



.k=2 



fc>2 n>k k>l 

= (1 - - c)-iL(C) rfC = ^(1 - ^)-^i'(^), 



and the claim foUows. 
In Ught of these facts we can now conclude that 

Lio,i(z) Hi - C)-MGi(C) - C^{^ - dC 
Jo 

= [(1 - zy'Liz) + r'(i)(i - - hiz) + o(i)] 

[-(1 - z)-ii(z) + + 1)(1 - z)-i + hiz) + 0(1)] 
= - [(1 - z)-'L'{z) + (1 - Li2,o(z)] + (i-i + !)(!- ^)-'i(^) 

- r'(i)(i - z)-'L{z) + r'(i)(L_i + i)(i - + - zr). 

To get an asymptotic expansion for Li2,o we use H4.1.4|) : 

2 2 

Li2,o(z) - t{\nt -Hi) + ^ + 0(|1 - zp) = !L _ (1 _ ^)L(z) + 0(|1 - 2 

6 6 

Hence we see that the Hadamard product 15.2.7|) has the asymptotic expansion 



□ 



(5.2.7) 



- (1 - z)-'L^z) - (1 - z)-' (y - (1 - z)Liz) + 0(|1 - z|)) 

+ (L_i + 1 - r'(l))(l - z)-'Liz) + r'(l)(L_i + 1)(1 - + 0(|1 - z\-') 

- -(1 - zr'L^z) + + 1 - r'(l))(l - z)-'L{z) 



+ 



r'(i)(L_i + 1) - -J (1 - z)-' + o(|i - zr). 



(5.2.. 
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5.2. The variance for binary search trees 



Using (j^njl . (EHSl, and in f^T^ . we get 

= C2 + (1 - z)'^L''{z) + 2r'(l)(l - z)-iL(z) + r"(i)(i - 



-(1 - z)-'L\z) + + 1 - r'(l))(l - z)-^L{z) + [r'(l)(i_i + 1) - (1 - 



+ 2 (1 - z)-^L^{z) - 2(L_i + 1)(1 - z)-^L{z) + (2 + 2L_i + Li^){l - z)-^ + 0(|1 - z]"^) 
= -(1 - z)-1l2(z) + - z)-iL(z) + ^2(1 - ^)"' + 0(|1 - (5.2.9) 
where 

-2r'(l) = 27 (5.2.10) 

and 

2 2 
^2 r"(l) + 4(r'(l)(L_i + 1) - + 2(2 + 2L_i + L^j) ^ 4 - 7^ - ^. (5.2.11) 

Let B{z) := B{z) - ro(2) = B{z) - By direct apphcation of the ETT, A{z), defined to 
be the generating function of (/2„(2)), is given by 

A{z) = ci(l - z)-2 + B{z) + 2(1 - z)-^ r B(C)(1 - C) dC, 

Jo 

where ci = ?'o(2) = . The integral here can be broken up as 

B{0{^-OdC = v- /'s(C)(i -C)rfC, 



where the first term on the right is the constant 

rfe(2) 



/ E'-'=(2)cM(i-c)dc = E- 

•^0 Vfc=i / k=i 



(fc + l)(fc + 2)' 

For the second term we use the following consequences of integration by parts: 



1 

2 



L^{C) dC = (1 - z)L^z) + 2(1 - z)L{z) + 2(1 - z), 



i(C) dC = (l-z)L(z) + (l-z) 



and the crude remainder estimate 



o{\i-cr^)dc^o{\i-z\'-^) 
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5.2. The variance for binary search trees 



Thus 



i?(c)(i-c)dc 

J z 

= (1 - z)-'^[-{l - z)L^{z) - 2(1 - z)L{z) - 2(1 -z) + Vi{l - z)L{z) 

+ Vi{l-z) + V2{l-z) + 0{\l-z\^-^)\ 
= -(1 - z)-^L\z) + (Fi - 2)(1 - z)-^L{z) + (Fi + - 2)(1 - z)-^ + 0(|1 - zp^), 

and 

A(z) = Cl{l ~ z)-2 - (1 - z)-1l2(z) + Vi{l- z)-^L{z) + V2{1- z)-^ + 0(|1 - ^1"^) 
+ 2\V{1 - + (1 _ z)-^L\z) - {Vi - 2)(1 - z)-^L{z) 

~ (Fi + V^2 - 2)(1 - + 0(|1 - zj-^)" 
= (Cl + 2V){\ - + (1 - zy^L'iz) + (V^i - 2{Vi - 2))(1 - z)-^L{z) 

+ (Fa - 2(T/i +V2- 2))(1 - + 0(|1 - 
= (C^ + 2V){\ - + (1 _ z)-^L\z) + (4 - T/i)(l - z)-iL(z) 

+ (4 - 2Vi - V2){\ - z)-^ + 0(|1 - z]-^). (5.2.12) 

We can now transfer this asymptotic expansion for the generating function to one for its 
Taylor coefficients, invoking Theorem 14 . 1 . 21 for the remainder estimate. We know that 

[z"](l - z)-^L{z) = H„ = Inn + 7 + 0{n-^) 

and, from the proof of Lemma 15.2.11 

[z"] [(1 - z)-'L\z)] =Hl-Y,-= (\nn + 7 + 0{n-')) - ^ + 0{n-') 



k=l 



k=l 



In Ti 

{\iinf + 27lnn+ ( 7^ - — 1 + 0( 



So, by singularity analysis (Theorem 



/i„(2) = {Ci + 2V){n+l) + (Inn)^ + 27lnn+ 7 



+ (4-T/i)(lnn + 7) 



+ (4 - 2Vi ~ V2) + 0{n-^+') 
In order to obtain the variance using H5.2.1|) we also need [recall 15.1.35|l ] 
jln{lf = (ln?i)2 + 41nn + 4 + 0(n-i+'). 
Hence, using H5.2.10() and H5.2.11|l we obtain an expansion for the variance: 



(5.2.13) 



al = (Ci2 + 2y)(n+l) + (27-l/i)lnn + 



2 

(7'- y) +(4-^^1)7-2^1-^2 



0{n- 



= {Cf+2V){n + l) - 4 - — +0(n-i+^). 



(5.2.14) 



Numerical calculations using Mathematica suggest that the next term in the expansion is —it-- 
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One could possibly obtain this by carrying out the program of this section, keeping track of 
more terms in the asymptotic expansions. 
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Chapter 6 



Random permutation model: limiting distributions 

In this chapter we wiU apply the Asymptotic Transfer Theorem (Theorem 13.2. 1|) of Chapter |3| 
to derive hmiting distributions for additive functionals defined on m-ary search trees under the 
random permutation modeL Our main resuhs are Theorems 16. 1.1116.1. 41 IS. 2. 21 and 16. 2.^ 
Throughout cr'^) denotes the normal distribution with mean fi and variance a^. 



6.1 Small toll functions 

We begin with two central limit theorems f Theorems 16.1.11 and 16 . 1 . jjl for "small" toll functions. 
Throughout, we write as shorthand for the sum over m-tuples (ji, . . . ,jm) of nonnegative 
integers summing to n — {ni — 1). 

Theorem 6.1.1. //2 < m < 26 and the real toll sequence satisfies 

(a) tn = o(\/n) and (b) } max — < oo for some < S < 1, (6.1.1) 

n^<k<n k 

then the mean /i„ and the variance cr^j of the corresponding additive functional on m-ary 
search trees with the random permutation model satisfy, respectively, 

Ki 

^« = 77 7n + o{^/n) -. fin + o{^/n), (6.1.2) 

J^m — 1 

with Ki defined as 

°° t- 

and 

^ oo 

al . + o[n), where := g (TTIKTT^' ^''■^■'^ 

with the sequence (r„) defined by rj :— for < j < to — 2 and 

'Tn ■= , n s y^jtn + Mji H ^Mi^-^n]^, n>TO-l. (6.1.5) 

Vm-lj i 



Moreover, 



and there is convergence of moments of every order. 

Proof. One can check (for any 2 < to < oo) that the variance cr^ vanishes for all n > to — 1 if 
and only if the toll sequence is chosen as 

tn = tmm{m — l.n}, n>0 
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for some constant t e M. In that case without loss of generality t — 1 and then X„ = n is just 
the number of keys and we have exact (though degenerate) normality. So we shall assume that 
a^>0. 

We use the method of moments together with the asymptotic transfer results of Section 1^?^ 
Given the toll sequence (i„) defining the sequence (X„) of random functionals of interests, 
the means (/i„) satisfy precisely the recurrence (|2.3.3|) with (6„) replaced by (t„). Thus H6. 1.2(1 
simply repeats the asymptotic transfer result H3. 2.1111 . 

According to the law of total variance, the sequence (cr^) also satisfies the recurrence (|2.3.3|l . 
but with (6„) replaced by (r„). Furthermore, according to Lemma [6. 1.21 to follow, the sequence 
(r„) satisfies the condition (|3.2.1|l . Then (|6.1.4|l is immediate from part (a) of the ATT (Theo- 
rem EHIP- 

Let Xn Xn — fj-(n + 1) for n > 0. We will complete the proof by showing by induction on 
k that 

/i„(A:) EX^, fc>l [with /i„(0) := 1] (6.1.6) 

satisfies 

fin{2k) ^ ^a^'^n'', and A«(2fc - 1) = o(«'"^), k>l. (6.1.7) 

Observe that (jOT^ and ^^T^ imply that (jOTjl holds for fc = 1. 

The key to the induction step for (|6.1.7|1 is to apply the law of total expectation to (|6.1.6|l . 
by conditioning on the principal subtree sizes . . . , \ Lm\- Letting denote the sum over 
(m + l)-tuples (fci, . . . , fcm+i) of nonnegative integers summing to k, for n > m — 1 we have in 
a manner identical to that for H2. 3.1(1 . 

n-(m-l) , _ 1 _ A 
[ni-l) j=0 ^ / 

where 

with ^jj. denoting the same sum as with the additional restriction that ki < k for i ~ 
1, . . . , m. Observe that ((6.1.8(1 is again of the same basic form 1(2. 3. 3() . We will apply the ATT 
after evaluating r„(fc) asymptotically. 

We will treat the induction step in detail only for /i„(2fc) at ((5.1.7(1 . the case for /i„(2fc — 1) 
being similar and somewhat easier. Let X^k* denote the sum over m-tuples (fci,...,fc„i) of 
nonnegative integers, each strictly less than k, summing to k (i.e., the same sum as with 
the additional restriction that fcm+i = 0). For fc > 2, we clearly have, by induction, 

r„(2fc) = o(n^) + ^ j__^^^.^(2/ci).../7,„(2/c„) 



2k \ 1 ^ (2fc)! -k, (2fc™)! 



°^"^+2^ Ufc ^k^^) ( n ^2^2^,^^f 

\m-l) j 



2/c,„ -kr, 



n \ n 
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But 



Vm — 1/ 



(m - 1)! j .T^ • • • .T^r_i\l - XI 

,r(fci + i)---r(fc„ + i) 



(m- 1)!- 



Xm-l) dxi - ■■ dXm-l 
1 



where the above integral is over (xi, . . . , Xm-i) € [0, 1]™ ^ with sum not exceeding unity. Since 
the number of terms in is C^^'^^^) ^ therefore have 



, . fc^ (2^)! 2k k 



_^ m!r(fc + l) 

r(fc + to) 



V m-1 y 
Similarly, 

Tni^k-l) ^ o{n'2)^ k>2. 
Now part (c) of the ATT (Theorem EHUl imphes ^TT^. 

The following lemma lies at the heart of the proof of Theorem 16. 1.11 



+ o(n'=), k>2. 



□ 



Lemma 6.1.2. In the context of Theorem \f).l.l[ the sequence (r„) defined at (|6.1.5|l satisfies 
the conditions (|3.2.1|l . 



Proof. Clearly 



with 



\m-l) j 



n > TO — 1 , 



Mri - + 1), 
which is o(^/n) by (|6.1.2|l . Recall the inequality 

,. n 2 



< 



(6.1.9) 



(6.1.10) 



(6.1.11) 



for real numbers ^i, • • ■ , ^/e- Applying this to H6.1.9|l . 

n— [m—l) 

<tn + /i„ + -pr-y 2^ 

Vm— 1/ j=0 



TO- 2 K^' 



(6.1.12) 



from which it is clear that r„ = o{n). 

To establish the summability of rn/ri^ we need only establish that of ji'^/n'^ . Indeed we may 
then use H6.1.12|) again, together with the fact that (|6.1.1|l fb'l implies 



E% 



< oo 



(6.1.13) 
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and the estimate 



E 



n— (m— 1) , > 

TO ■^-^ fn—l — j\^2 



( n ^ E 
vm— 1/ J— 



TO — 2 



= TO(m - 1) XI E 

j^O n—j-\-m— 1 
oo oo 



(n - 1 - jY- 



<to(to-i)Xa? E --^ = E^ < 



oo. 



J' 



j—0 n— j+'Ti— 1 \ 

To establish the summabihty of jl^/'n? ^ we recah from (|3.2.15|) and H3.2.8|) that 



(fc + 



(6.1.14) 

writing aj := 5R(Aj) (with a — a2 < 3/2, since m < 26). Using (|6.1.11|) . we need only establish 
the summability of times the square of each of the four terms on the right in H6.1.14|l . The 
first of these verifications is trivial, and the second was carried out at H6.1.13|) . For the third we 
apply the Cauchy-Schwarz inequality to give 



H 2 



E 



(fc + l)(fc + 2) 



< 



[k + l){k + 2)^ 



H 2 



\ k—n / \ k—n / 



whence 



E 



E 



tk 



(fc + l)(fc + 2) 



by H6.1.13|) again. 

We pause to note that when to = 2 the proof is finished here, and that up to now we have 
used only (|6.1.13|) . not the stronger assumption l|6.1.1|) fb'). 

For our fourth and final verification, it suffices [again by invoking (|6.1.11f) ] to establish the 
summability of 



2p-4 



E 

.fc=l 



kP 



(6.1.15) 



for any real p < 3/2. To do this we break the sum into J2k<n'' ^^'^ ^n^<k<n ^'^^ once again 
invoke H6.1.11|l . In the range X]fc<n* simply use tk = 0{Vk) and note that 



.2p-4 



Y^o[k 

k<n^ 



1-P 



56 



6.1. Small toll functions 



with r < — 1. In the range 



.■5<fc<r 



we use Cauchy-Schwarz again: 



E\m_ 
kP 

n^<k<n 



E 



k2~' \tk\ 



n^<k<n n2 
1 



3 pA:i/2 



3 p k 



= O 



-1 "-i- 

rt max 

n^<k<n k 



which is sumniable by assumption Ht).l.l|) fb'). 



□ 



Remark 6.1.3. (a) We have already noted in the proof above that condition H6.1.1|l fb) trivially 
implies 



°° ,2 



< oo, 



(6.1.16) 



which in turn implies that H3.2.1|) holds with absolute convergence; indeed, since the non- 
negative numbers [(n + l)(n + 2)]^^, n > 0, sum to unity, we have 



E 

n=0 



\tn\ 



(n + l)(n-|-2) 



< 



E 

n=0 



< oo. 



(b) If 



oo ~2 



|t„| = 0{i„) with < ^ i and y ^ < 
then we claim that H6.1.1I) holds, and then as a corollary 



oo. 



(6.1.17) 



(6.1.18) 



^iO and Y.h< 



oo 



implies (|6.1.1|l . To see this claim, first observe that the condition l|6. 1.18(1 implies ((6.1.1|l fa): 
moreover, we observe that the series (say, over 2 < n < oo) in H6.1.1|) fb) is bounded by a 
constant times 



n=2 



max — 

,^<k<n k 



E^ 



t 



E^ E 



"-We 



fc=2 (fc-l)i/«<„<fci/* 



\k=2 



1 
fc2 



< OO. 



(c) We have also observed that, when 771 = 2, the proof of Theorem l6. 1 . ll reauires only l|6 A. Ij) (a) 

and H6.1.16|l . In that case we obtain a strengthening of "Case SI" of Theorem 2 in (Sg (for 

1 

deterministic toll sequences); they required t„ = 0(-y77/(ln7T,) 2+^) for some e > 0. 

We can also obtain asymptotic normality in the "borderline small" case. The definition of 
slowly varying is available as Definition l3.2.6l 

Theorem 6.1.4. If 2 <m < 26 and the real toll sequence (t„) satisfies 

tn ~ VnL{n) (6.1.19) 
with L slowly varying, then the mean fj,n of the corresponding additive functional Xn on m-ary 
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search trees with the random permutation model satisfies 



^" ^ TT^" - , + o{V^L{n)), (6.1.20) 

tim - i m! - (3/2)™^^ 



with Ki defined at (I6.1.3|l . 



i. If^°° ~~F^ ^''■S'^ variance af^ satisfies (|6.1.4|l -( |6.1.5|l and we de/s 



Tie 



2. IfYT^^oo, then 



k 

al^s'{n):=a'nJ2^, (6.1.21) 



k 

k<7i 



where in this case we define 



^2 ((3/2)— i)^[f(m-l) + l]-(m!)^ ^ ^ 

(i7™-l)[m!-(3/2)'"-i]2 

Moreover, in either case 

^Vf ^AA(0,1) 
s(n) 

awd f/iere is convergence of moments of every order. 

Proof sketch. Given the similarity to the proof of Theorem 16. 1.11 we will be brief here. Again 
we use the method of moments together with asymptotic transfer results. 

Equation (|6. 1.20(1 simply repeats the transfer result (|3.2.12|l . As before, (cr^J satisfies the 
recurrence (|2.3.3|l with (6„) replaced by (r„) of (|6.1.9|I - H6.1.1U|I . where again fi :— Ki/{H„i — 1) 
and rj := for < J ^ m— 2. Here the proofs diverge somewhat. The analogue of Lemma l6 . 1 . 2l is 
Lemma l6.1.5l below. Then the asymptotic variance assertions of the theorem follow immediately 
from Theorem 13 . 2 . Tf c *) . 

If ^^T^ < then from l|3.!^.10|l apphed to it follows that l^T^ satisfies ifHTTjl for 
fc = 1. Then higher moments are treated exactly as in the proof of Theorem 16 . 1 . II to complete 
the proof. 

liJ2°°LlM= oo, then one uses (|3.2.1U|I . Theorem 13 . 2 . Tr dl . and induction to show that the 
moments ()6.1.6|l satisfy 

finm^^s^'^in) and /i„(2fc - 1) = o(s2'=-i (n)), k>l (6.1.23) 

and thereby complete the proof. We omit the details. □ 

The following cousin to Lemma 16.1.21 was used in the proof of Theorem 16 . 1 . 41 above. 
Lemma 6.1.5. In the context of Theorem \6.1.4\ the sequence (r„) defined for n > m — 1 by 

\m-l) j 

satisfies 

rn ~ {Hm - l)(T^nL^(n). 
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Proof. By (|6.1.!^0| . with 6 (3/2)"-V(m! - (3/2)'"-i), we have 



nL'^[n){m — 1)! 



1/2- 



1/2 



dxi ■ ■ ■ dXjn-l, 



where the integral, call it J, is over (xi, . . . , Xm-i) G [0, with the sum not exceeding unity. 
To complete the proof we need only show J = {Hm — l)^^/ (m — 1)!, with defined at ()6.1.22|l . 
Indeed, J equals 



(1 + 9)' 



rn— 1 

2^1 



7rL— 1 



1/2 



1- -20(1 + 0)^ 



1/2 



29(1 



1-E 



\ 1^1 

/ rn— 1 

E ^l/\^/V2.^ i-E^ 



1/2. 



2 a;i/2-i/2 

1 



E 



1/2 



1=1 



dxi • • • dXm~l 



= [(1+0)2+^2 



(m - 1)! 
1 



_ 20(1 + 9)m ^^2) ^ ^2^(^ _ ^) [r(3/2)]; 

^ ^ r(m + i) ^ ■'r(m + i 

1 



20(1 + 0)m 



(to + 1) 



7r/4 



(m-1)! "'^ ' '■'■■^(3/2)"-i (to-2)!' 
Plugging in the value of and simplifying, we obtain J = (i?,„ — 1)0-2/(771 — 1)1^ as desired. □ 

Remark 6.1.6. When to = 2 the constant in (|6.1.22|l equals |7r — 14, and Theorem 16.1.41 
reduces to "Case S2" of Theorem 2 in 32j (for deterministic toll sequences): see especially their 
displays (15) and (17), with T2 = 1. 

Periodicity for m > 27 

If t„ — o{y/ri) as in Theorem 16. 1 . II but m > 27, then the remainder term /i„ := ^„ — m("- + 1) 
for the mean — which by H6. 1.2(1 was o{y/n) when m < 26 — now satisfies, by the ETT (Theo- 
rem EUl) and IpTT^ [compare l|3.2.15|l ] 



,A2-1 



r(A2 



C3 



n^3"i , 1 

U ml 



r(A3) 



J=2 



+ o(\AI) + 0(n^(^*)-i). (6.1.24) 



Typically this will lead to the negative result that (/i„) [and hence also (r„) and (o-2)] suffers 
from periodicity and that there is no natural distributional limit for normalized X„. See, e.g.. 
Corollary 1 of [g. 

But it is perhaps difficult to establish a general negative result, due to cancellations. For 
example, suppose T(z) equals (1 — z)~^, so that i„ = 1, as studied by Chern and Hwang 
except perhaps that the initial values to, ... , tm-2 are changed. Then 



r(™-i)(z) = (m- 1)!(1 -z)- 
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whence 



m— 1 



rn— 1 ^ m— 1 \ \ ■ 



(l-z)- 



Now it is possible to choose Iq, . . . , tm-2 so that 

_ (m-l)! 



(i-A,)^'(A,r 



j = l,...,m-l. (6.1.25) 



In that case A{z) = ^ — 2)^^, whence /x„ = = — 1/(to— 1) [and we see that the chosen 
values oito, ■ ■ ■ ,tm~2 are all — l/(m— 1)], and we get linear variance and asymptotic normality, 
just as in Theorem 16. 1.11 for every 2 < m < 00. 

One might object that the above example is artificial, in that the toll sequence changes sign. 
But the same calculation show that if the toll sequence is chosen as above (t„ = 1) but with 
inital values 

tj:=K{j + l) 3— , 0<j<m-2, 

m — 1 

then still, for every to > 2, the sequence (p-n) is constant, the variance is linear, and we have 
asymptotic normality. Further, (<„) is nonnegative provided K > l/(m— 1). [We remark in 
passing that the choice K — 1/(to — 1) leads to the degenerate case mentioned at the beginning of 
the proof of Theorem l6.1.1l ] The sequence (t„) is also nondecreasing (as in most real examples) 
provided K < to/ (to — 1)^. 



6.2 Moderate and large toll functions 

In order to describe the limiting distribution of X„ for moderate and large tolls, we will introduce 
a family of random variables F = defined for /3 > 1/2, /3 7^ 1. Anticipating Lemma [6. 2. II we 
need to consider the distributional equation 

m 

r^E^f^^ + i (6.2.1) 

Here (i^)jLi are independent copies of Y and {Si, . . . , Sm) is uniformly distributed on the 
(to — l)-simplex, independent of {Yj)^^^. Recall that the (to — l)-simplex is the set 

{(si, . . . , Sm) ■ Sj > for 1 < j < m and s_|_ = 1}, 

where s+ denotes J^jLi Sj- 

Let C/(i), . . . , J7(m-i) be the order statistics of a sample of size to — 1 from Unif(0, 1) with 
joint density 

/t/(i),...,;7(„_i)(a;i, . . - ^Xm-i) = {m - 1)! 1(0 < xi < • • • < x^-i < 1) 

with respect to Lebesgue measure on M™~^, where 1(A) is the indicator of A. 

By a change of variables, we find that the joint distribution of the spacings Si, ... , Sm, 
defined, with [/(q) := and ?7(m) '■— li by 

Si := U^i) - , i = 1, . . . , TO, 
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is uniform over the (m — l)-simplex: 

/si,...,S™-i(si,---,Sm-i) = (m- l)!l(sj > 0,j = l,...,m- 1; s+ < 1) . 
When rj > — 1 for 1 < j < m, observe that 



E 



(m- 1)! 



si ,...,s„_i>0 
s+<l 



- S+)''" dsm-1 ■■■dsi 



(m-l)!B(ri + l,...,r„. + 1) 

n™ir(r, + i) 



(6.2.2) 



(to - 1)! 



r(ri H 1- + to) ' 



:= E y = 1 



Lemma 6.2.1. fii /3 > 1/2 mt/i /3 ^ 1. Then there exists a unique distribution C{Y) = C{Yfj) 
with finite second moment satisfying the distributional identity H6.2.1() . 

Proof. We first observe the that mean of any such distribution is determined by 16.2.1(1 . Indeed, 
by taking expectations in H6.2.1(l and using (|6.2.2|l . we get 

TO!r(/3+l) 
r(/3 + TO) 

since /3 7^ 1. Thus we can equivalently consider the distributional identity 

m 

where 

m 

i7:=l-At + ME^f- 

Here W is restricted to have mean and finite second moment, (Wj)JLi are independent copies 
of W, and (5*1, . . . , S^) is uniformly distributed on the (m— l)-simplex, independent of (Wj)"ij. 

We now employ a standard contraction-method argument Issj . Let ^2 denote the metric 
on A^2(0), the space of probability distributions with mean and finite variance, defined by 

d2(Gi,G2) :=min||X2-Xi||2, 

taking the minimum over all pairs of random variables Xi and X2 defined on a common prob- 
ability space with C{Xi) = Gi and C{X2) — G2- Here ||-||2 denotes L2-norm. 
Let T be the map 



r:7W2(0)^7W2(0), G^C^^S^Xj+Hj , 

where {Xj)JLi are independent with C{Xj) = G, j = 1, . . . , to, and (5*1, . . . , Sm) is uniformly 
distributed on the (m — l)-simplex, independent of {Xj)"'^^. We show that T is a contraction 
on A^2(0); more precisely, that there exists a p < 1 such that 

d2{T{£{A)),T{£{B))) < pd2{£{A),£{B)) 

for all pairs £{A) and £{B) in 7^2(0). To bound d2{T{£{A)),T{£{B))), we couple T{£{A)) 
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and T{£{B)) by taking m independent copies {Aj.Bj) of the (i2-optimally coupled {A,B), an 
independent {Si, . . . , Sm), and defining 

m m 

^' ■= J2 ^j^^i +^ ^ T{£{A)), B' := ^ S'l'Sj + ~ T{C{B)). 

Now, defining S := (5*1, . . . , 5*^) and using the law of total variance, 
d2{C{T{A)),C{T{B))f 



< \\B'-A'\\l 



E Var 



i=i 



Var 



Var E 



J2s^iB,^A,) 



E(E5f )Var [B, - A,] = d^idA), dB))' Y.^ S: 



2/3 



, r(2/3 + i) 

'r(2/3 + m) 



We need only verify that 



:= m! 



r(2/5+ 1) _ rn! 

r(2/3 + m) ^ (2/3 + m-l)--- (2/3+1) 



< 1, 



which is true when /3 > 1/2. The existence and uniqueness of C{Y) now follows from the Banach 
fixed point theorem ■65', Theorem 2]. □ 

Moderate toll functions 

In the case of moderate toll functions, convergence in distribution and convergence of all mo- 
ments can be stated as 

Theorem 6.2.2. If the real toll sequence (t„) satisfies 

tn ~ nP, 1/2 < /? < 1, 

and a < \ + [3, then the mean of the corresponding linear functional on m-ary search trees 
with the random permutation model satisfies 



Pn = A*" 



(1+/?)" 



^^n*^ + o(n'^), p 



m!-(l+/5) 
with Ki defined at (I6.1.3|l . Moreover, 

Xn - pn c 

with convergence of all moments. 



Hm — 1 



(6.2.3) 



Proof. We use the notation introduced in the proof of Thcorem l6.1.1l Equation (|6.2.3|l is simply 
a restatement of the asymptotic transfer result H3.2.12|l . 
We show that the moments Pn{k) satisfy 



Pn{k) — gkn'^^ + o{n''^) as n — s- oo 



fe/3N 



(6.2.4) 
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The claim holds for k — I with 



91 ' ' — ^= =1 ' / . (6.2.5) 



Using H6. 1.9(1 and (|6.1.8|l . by induction we get, for fc > 2, 

. . . , km, 



But 



so that 



(m-l) i 



/ ■ \ fci/3 













fcm/3 

-> (m-l)!B(fci/3+l,...fc™/3+l) 



r„(fc)-o(n'='5)+n'^-'^(m-l)!^* f ^ ) gfc, • • • 5fc„.B(fci/3 + 1, . . . , fc„/3 + 1). 

Using Theorem imi (the ATT), with w = fc/3 > 1, we get 

^„(fc)=o(n^-/^)+n^/'^!|^^^V^ ^ ^ ).g,,....9,„B(M+l,...,fc™/3+l). 

r(/c/3+m) k \ 1' ■ • ■ ' "I' 

Thus, defining 5^ recursively as 

9^ = T^^M±^^*{k. \ \k^---9u^B{k,p+l^...,k„,p+ll (6.2.6) 

we see that (|6.2.4|) holds for all A; > 1. 

By Lemma [6. 2 .31 fto follow) and the method of moments (cf., e.g., 0, Sections 4.4 and 4.5]), 
the gkS are the moment of a uniquely determined distribution, say C{Y), and 

with convergence of all moments. It remains to show that Y — Yp. 
Define 

m 

where {Yj)JLi are independent copies of Y and (6*1, . . . , Sm) is uniformly distributed on the 

(to - l)-simplex, independent of (y,-) ^i. W e wiU show that F = F, and then, by if^TTIl . £(?) 

satisfies the distributional identity (|6. 2.1(1 and has finite second moment. By Lemma 16.2.11 
^ c 

Y — Yf3, as desired. 

To show F = y, it suffices to show that Y and F have the same moments. Letting 
denote (as before) the sum over (to + l)-tuples (fci, . . . , km+i) of nonnegative integers summing 
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to fc, and using ^TT^, if^T^ . and (jO?^ . 



E y'= = ^ 

k 

= E 



fci J ■ ■ ■ J kjYi+i 



E 



fcl , . . . , fcrn+l 



(m - l)!B(fci/3 + 1, . . . fc™/3 + l)gk, ■ ■ ■ gk„ 



ki^ . . . ^k. 



m+l 



B{kiP + 1, . . . kjnP + l)gki ■■■9k„ 



m!B(fc/3+l,l,...,l)5fe 



^ _ m!r(fc/3+ 1) 
~ r(fc/3 + to) 



fffc 



m!r(fc/3+ 1) 
r(fc/3 + to) 



5fc = 5fc = E 



where (as before) denotes the same sum as with the additional restriction that ki < k 
for i = 1, . . . , to. □ 

Lemma 6.2.3. The moments (gk) uniquely determine the distribution C{Y). 

Proof. Define '■= gk/k\. It suffices to show that there exists an M such that 7^ < for all 
k sufficiently large. We proceed by induction. Indeed, by H().2.6|l we know 



Ik 



(to - 1)! 

m\T{k(3+ 



< M 



r(fc/3+m 



k ("^-1)! 



1 / ™ \ 

TyE*! 1 n^'^-. BikiP + l,...,kmP + l) 



1 



m!r(fcg+ 



S(fci/3 + l,...,fc™/3 + l) 



r(/c/3+m) k 

by the induction hypothesis. So it is certainly sufficient to show that 



B(fci/3 + l,...,fc™/3+l) 



E pr((fc-A:™+i)/3 + TO)-i 



E nr(M + i) 



0<fcl,...,fcm<fc j = l 

^iH h/C?Tl— ^TTl+l 



— > as fc ^ cx). 
For this, fix a value of fc„i+i G {0, 1,2,.. .}, and consider the sum 



(6.2.8) 



5^ llT{k,P+l). 



0<ki,...,km<k j^l 

/^^ -I ^/j;^^ — 



(6.2.9) 



By log -convexity of F 6.4.1], taking / to be (0,oo) and g to be InF in Proposition 3.C.1 
of [53, the logarithm of the product in (|6.2.9|l is Schur-convex on (0, 00)™. Thus applying 
Proposition 5.C.2 of |H3 with m — there, the biggest terms in the sum correspond to kj = 
k — kra+i for j equal to some jo and kj — otherwise; together, these to terms contribute 
TOF((fc — km+i)P + 1) to the sum. If fc > k,n+i, there are other terms, the biggest of which 
corresponds to having one of the fcj's be fc — km+i — 1, one be 1, and the rest be 0. (This follows 
from Proposition 5.C.1 of with to = and M = k — fcm+i — 1 there.) The total number of 
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terms in the sum (|6.2.9|) is at most (™ ^^m-i'"^^'')- remaining contribution to Ht).2.9|) is 

at most 

''^-^ + ^''- r((fc - fc„+i - + i)r(/5 + 1). 

m — 1 / 

We have found that the left side of (|6.2.8|l is bounded by 

—, j-l(fcm+l < k) f{k - k,n+l), 

f^m + l—'J 

where 

TO (™-it'^)r((fc- l)/3 + l) 

^ (-ir)r(fc/3 + i) _ 1 (fc + i)...(fc + (TO-i)) 

r(fc/3 + m) (m-1)! (fc/3+l)---(fc/3+(m-l))' - ^ 

which is a bounded function of fc. To apply the dominated convergence theorem, it suffices 
to show that the right side of (|6.2.10|l tends to as fc — > oo, which follows from Stirling's 
approximation and the fact that /3 > 0. □ 

When t„ satisfies the conditions in Theorem 16.2.21 but a > 1 + P then [compare H6.1.24() ]. 



and typically this leads to periodicity. 

It is known that a < 3/2 for m < 26. Computations show that a increases with m, at least 
for TO < 10000 51] . This suggests that for a fixed (3, the condition a < 1 + (3 (resp. a > 1 + /3) 
is equivalent to m < mg (resp. m > toq) for some mo > 26. 

Large toll functions 

If tn ^ , where /? > 1, then we have convergence in distribution for all values of m. We state 
the result, omitting the proof, as it is very similar to that of Theorem 16 .2. 21 

Theorem 6.2.4. If the real toll sequence (t„) satisfies 

tn ^n'^, P> 1, 

then 

Xn C 

with convergence of all moments, where C{Yfj) is the unique distribution satisfying (|6.2.1|l . 

The borderline case i„ ~ n [more specifically, i„ = n — (to — 1)] corresponds to the well- 
studied total path length of a random TO-ary search tree. The corresponding additive functional 
measures the number of key comparisons in m-ary Quicksort. As is well known [6^ . the number 
of key comparisons has mean Q{nhin) and standard deviation G(n). See |36l[ for details in this 
case. 
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Chapter 7 
Catalan model: the toll function 

We will now use the extended singularity analysis toolkit from Chapter ^ to derive limiting 
distributions for the additive functionals on binary search trees under the uniform model for the 
class of toll functions of the form n" where a > 0. 



7.1 Preliminaries 

In the sequel the notation [• • •] is used both for Iverson's convention [isl see 1.2.3(16)] and the 
coefficient of certain terms in the succeeding expression. The interpretation will be clear from 
the context. For example, [a > 0] has the value 1 when a > and the value otherwise. In 
contrast, [2;"]F(z) denotes the coefficient of z" in the series expansion of F{z). For notational 
convenience we define L{z) := ln((l — z)~^). Unless otherwise noted, henceforth all our singular 
expansions are valid as z — *■ 1 in the sector — tt + e < arg(l — z) < tt — e, for any e > 0. 

We will make extensive use of the following consequences of the singular expansion of the 
generalized polylogarithm. Note that neither this lemma nor the ones following make any claims 
about uniformity in a or r. 

Lemma 7.1.1. For real a < 1 and r a nonnegative integer, 

r 

Li„,,(z) = ^ Ai"''^)(l - zr-^U-\z) + 0(|1 - zr-) + {-lYC^^\a)[a > 0], 

fc=0 

where A^"''^^ = (JJ)r^'^''(l — a) and e > is arbitrarily small. 
Proof. By Theorem 14.1.31 

Li„,o(^)'-^(l-«)^-l+^tl^C(«-J)^^ i = -lnz = f;^i^, (7.1.1) 

and for r a positive integer, 



i>o 1=1 



gr 

Lia,r(z) = {-iy-^Liafl{z). 

Moreover, the singular expansion for Lio, is obtained by performing the indicated differentiation 
of H7.1.1I) term-by-term. To establish the claim we set / = r(l — a) and g = t"~^ in the general 
formula for the rth derivative of a product: 



k=0 

to first obtain 



^ k=0 
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The claim then follows easily. 



□ 



The following "inverse" of Lemma lT.l.ll is very useful for computing with Hadamard prod- 



ucts. 



Lemma 7.1.2. For real a < 1 and r a nonnegative integer, 

r 

(1 - zr-^L-{z) = -"i"'''^ Li„.r-fe(^) + 0(|1 - z^-') + cr{a)[a > 0], 



k=0 



where /iQ"''' = l/r(l — a), Cr{a) is a constant, and e > is arbitrarily small. 
Proof. We use induction on r. For r = we have 

Li„^o(2) = r(l - a)(l - z)"-i + 0(|1 - z\"-') + Cia)[a > 0] 
and the claim is verified with 



(a,0) 

Mo 



1 



and Co (a) 



C(a) 



r(i -a) ' ' r(i - a) 

Let r > 1. Then using Lemma l7 . 1 . II and the induction hypothesis we get 

Lia,r(z) = r(l - - z)"-^L'^{z) 

r ["r — k 



(a.r) 



^Mr''^" Li„,,_fe_,(z) + 0(|1 - zl"-^) + cr-k{a)[a > 0] 



+ 0(|1 - z\"-') + (-l)''C('-)(a)[a > 0] 



r—k 



= r(l - a)(l - zr-'L^iz) + ^ Ai"'^) Yl ^1-1=? Li„,,(z) + 0(|1 - zl"-) 

fe=l s=0 

+ (T. >^k'''^^r-kia) + (-l)'-CW(a) j [a > 0] 

r-l 

= r(l - a)(l - zr-^L'^iz) + J2 ^^i"^"' Li„,3(z) + 0(|1 - zl"-^) + 7r(a)[a > 0], 



s=0 



where, for < s < r — 1, 



and where 



E, (Q,r) (Q,r-fe) 



fc=l 



V(a) :=^Ar''^)c._fe(a) + (-ircW(a). 



Setting 



and 



the result follows. 



(a.r) 
Mo = 



1 



r(i-a)' 



(a.r) 



(Q,r) 



Cr(Q;) 



r(l-a) 
7r(a) 



, l<k<r, 



r(l-a)' 



□ 
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For the calculation of the mean, the following refinement of a special case of Lemma IV. 1. H is 
required. It is a simple consequence of Theorem 14.1.31 

Lemma 7.1.3. When a < 0, 

Li„,o(z) = r(l - a)(l - - r(l - - zT + 0(|1 - 2!"+^) + > -!]■ 

7.2 Asymptotics of the mean 

We start by reviewing some notation. Let /3„ denote the nth Catalan number 

1 /2n^ 



n + 1\ n / ' 

with generating function 



00 

CAT(z) V /3„z" = -(1 - Vl^)- 



2z 

In our subsequent analysis we will make use of the identity 

z CAT2(z) = CAT(z) - 1. (7.2.1) 

The mean of the cost function a„ := EX„ can be obtained recursively by conditioning on the 
key stored at the root as 

(aj_i + a„_j) + i„, n > 1. 

j=i 

This recurrence can be rewritten as 

n 

2j2i0J~i(^J-i)l3n~j + {l3ntn), U > 1. (7.2.2) 

J = l 

Multiplying H7.2.2f) by z"/4" and summing over n > 1 we get 

Aiz) CAT(./4) = nz)^Cmz/^) ^ ao^^ ^^^^ 3^ 

VI — z Vl — 2; 

where A{z) and T{z) are the ordinary generating functions of (an) and (tn) respectively. In our 
analysis we will assume ao = to so that the second term in (|7.2.3|1 does not contribute. (Our 
results can be easily adjusted when this is not the case.) We will use (|7.2.3|l (with aq = to) as 
the starting point for our analysis. 

Since t„ = n", by definition T = Li_„.o. Thus, by Lemma l7.1.3[ 

T{z) = r(l + a){l- z)-"-i - r(l + a)^^(l - ^)"" + 0(|1 - z\-"+') + C(-«)[a < !]• 

We will now use 1)7.2.3(1 to obtain the asymptotics of the mean. 

First we treat the case a < 1/2. Since CAT(z/4) = 2 + 0(|1 - z|i/2) as z ^ 1 in a suitable 
indented crown, we have, by Theorem I4.3.5f b). 

T{z) CAT(z/4) = Co + 0(|1 - ^1""+^), 
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where 



•o:=r(z)0CAT(z/4)^^^ = ^n"^. 



n=l 



We now aheady know the constant term in the singular expansion of T{z) CAT(z/4) &t z = 1 
and henceforth we need only compute lower-order terms. The constant c is used in the sequel 
to denote an unspecified (possibly 0) constant, possibly different at each appearance. 
Let us write T{z) = Li{z) + Ri{z), and CAT(z/4) = L2{z) + R2{z), where 

r(l + a)(l - - r(l + a)^(l - z)-'^ + C(-a), 

i?i(z) :=r(z)-ii(z) = 0(|l-z|i-"), 
L2{z) :=2(l-(l-z)i/2), 
i?2(2) := CAT(z/4) - L2{z) = 0(|1 - z\). 

We will analyze each of the four Hadamard products separately. First, 

Li(z)0L2W = -2r(l + a)[(l-z)-"-i0(l-z)i/2]+2r(l + a)^^[(l-z)-"0(l-0)i/2]+c. 



By Theorem 11331 



(l-z)-"-i0(l-z)i/2 = c + 



r(a-i) 



r(a + i)r(-i/2) 



(l_^)-+2+0(|l-^|), 



and 



(l-z)^"0(l-z)i/2^c+O(|l-z|) 
by another application of Theorem 14. 3. Sf b). this time with fc = 1. Hence 

Li(z) L2{z) - [Li(z) L2(z)] + - + 0(|1 - z| 

The other three Hadamard products are easily handled as 

Li(z) i?2(z) = R2{z)\ + 0(|1 - zr"+^), 

2:— 1 

L2{Z) i?i(z) = [L2(;2) Rliz)] + 0(|1 - z|), 

2—1 

Ri{z) ii-aW = [^iW Mz)] + - z|). 
Putting everything together, we get 



T{z) CAT(z/4) = Ca + 
Using this in H7.2.3(l . we get 



- z)-"+5 + 0(|1 - zr"+i) 



A{z) CAT(z/4) = Co(l - + £i^^ 



(l-z)-" + 0(|l-zr"+2). 



(7.2.4) 
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To treat the case a > 1/2 we make use of the estimate 

(1 - ^f'^ - j;p^[Li3/2,o(^) - C(3/2)] + 0(|1 - z|), (7.2.5) 
a consequence of Theorem 14 .1.31 so that 

T{z) (1 - z)i/2 = Li_„,o(z) (1 - zfl^ = ^^y^ Li3_^^g(z) + R{z), 

where 

{c + 0(|l-z|i-") l/2<a<l 
0{\L{z)\) (7.2.6) 
0(|l-z|i-") a>l. 

Hence 

r(z) CAT(z/4) = -^^y2) ^^f-a.o^^) + 
where i?, hke i?, satisfies (|7.2.6|l (with a possibly different c). When a — 1/2, this gives us 
T(z) CAT(z/4) = -^^l^^L[z) + c+ 0(|1 - zj^/^), 

so that 

A{z) CAT(z/4) = ^(1 - z)^i/'L(z) + c(l - z)-!/^ + ^(i), (7 2.7) 

For a > 1/2 another singular expansion leads to the conclusion that 

A{z) CAT(z/4) = ll(l - z)-" + i?(z), (7.2.8) 

where 

{0(|l-zr5) l/2<a<l 
0{\\ ^ z\-\\L{z)\) a^l 
0(|l-zr"+5) a>l. 
We defer deriving the asymptotics of a„ until Section mi 

7.3 Higher moments 

We will analyze separately the cases < a < 1/2, a — 1/2, and a > 1/2. The reason for this 
will become evident soon; though the technique used to derive the asymptotics is induction in 
each case, the induction hypothesis is different for each of these cases. 

Small toll functions (0 < a < 1/2) 

We start by restricting ourselves to tolls of the form n" where 0<a<l/2. In this case we 
observe that by singularity analysis applied to H7.2.4|) . 

4" Vtt Vtt 



70 



7.3. Higher moments 



a„ = 7ii[l + 0{72-')][Con-^ + 0(ji"-i)] = Cqti + 0(n"+5) = (Co + o{l)){n + 1). 

The lead order of the mean a„ = E Xn does not depend on a. We perform an approximate 
centering to^get to the dependence on a. ^ 

Define X„ := X„ - Co{n + 1), with Xq := 0; /i„(fc) EX,^, with /i„(0) = 1 for all n > 0; 
and flnik) := Pnfin{k)/A^^ . Let M]~{z) denote the ordinary generating function of in the 

argument n. 

By an argument similar to the one that led to H7.2.2|) . we get, for /c > 2, 

Mk) = -'^^;-^ilj-l(k) +fn{k), 71 >1, 



where 



1 / k \ 

fel,fc2<fe 



1 / \ " 

i E (fc, fc, fcJ*»'E/^j-i(^i)A»-j(^2), 



/ci ,k2<k 

for n > 1 and fo(fc) := fJ.o{k) — fj,o{k) = {—1)'^Cq. Let Rk{z) denote the ordinary generating 
function of f„(fc) in the argument n. Then, mimicking (|7.2.3|l . 

M,{z) = (7.3.1) 



with 



R,iz) = i-irc^ + J2 ik .\)(TW®'^)0[jM,,(z)Mfc,(z)], (7.3.2) 

ki,k2<k 

where for k a nonnegative integer 

T{z)®^ :=T{z)Q---QT{z). 

fe 

Note that Mo(z) = CAT(z/4). 

We analyze separately the cases < a < 1/4 and 1/4 < a < 1/2. The proof technique in 
either case is induction, though the latter case is slightly simpler. 

Proposition 7.3.1. In the notation introduced above, when < a < 1/4, for k > 1, 

Mkiz) = Cfe(l - z)-^^°'+h + ^ + 0(|1 - 2|-M"+5) + ^ + (2a-.))^ 

as z ^ 1 in the sector — tt + e' < arg(l — z) < n — e' , where e > and e' > are arbitrarily 
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small. The Ck 's here are defined by the recurrence 

1 ^ r(fca + |-i) r(a-i) , 



Proof. For A: = 1 the claim is true as shown in (|7.2.4|l with Ci as defined in H7.3.3|l . We will now 
analyze each term in (|7.3.2|l for k > 2. For notational convenience, define a' :— a + Also, 
observe that 

r(z)0'= = Li_fc„,o(^) = r(i + fca)(i - z)-'^"-! + o(|i - zj-'^"-^) 

by Lemma 17.1.11 

For this paragraph, consider the case that ki and /c2 are both nonzero. It follows from the 
induction hypothesis that 

|a4,(z)M,,(z) = 1(1 - (1 - z))[Cfc,(l - + 0(|1 _ 

X [Ck,{l - z)-^'°''+^ + Oi\l - z\-''-'°''+^+^^"'-''>)] 

If ^3 = then the corresponding contribution to Rk{z) is 

iQJc,,Cfe,(l - + l +0(|1 - ^|-W + l + (2a-.))^ 

If A:3 7^ we use Lemma [V. 1.21 to express 



The corresponding contribution to Rk{z) is then times: 



Cfci Cj 

4r((fc 



Now fca < fc — 2 so — fca' + + 2 < 1. Hence the contribution when fca 7^ is 

o(ii - zr'="'+^+i) === o(|i - zr'="'+i) ^ o(|i - ;j:|-fc"'+i+(2"-e)). 

Next we consider the case when ki is nonzero but fc2 = 0. In this case using the induction 
hypothesis we see that 

|Mfe, WMfe(z) = CAT(z/4)Mfe,(z) 

= i— ^i^-^^[Cfe,(l-z)-'^'i"'+5] +0(|1 -zp''-i"'+^+(2"-')) 

= ^(1 - + 0(|1 - 2|-fei"'+5+(2a-^))_ 



72 



7.3. Higher moments 



Applying Lemma r7.1.2l to the last expression we get 

X C 

-MkJzWkJz) ^ — - — ^Li 3 (z) 



Cfeir, f 1 . iiC(-fcia' + i) 



+ 0(|l-zr-+2+(2"-))-l±l[fc,a'-i <1] 

The contribution to Rk{z) is hence (^^) times: 

-— %^Li .3 3j^) + Li-fe3„.oW©0(|l-^r'-^"'+^+('"^^)). 

Using the fact that a > and fca < fc — 1, we conclude that —ka' + + f < 1 so that, by 
Lemma It. 1.11 and Theorem 14 . 3 . Sf a) . the contribution is 

0(|1 - z|-'^"'+^+5) = 0(|1 - zp'^^'+i) 

where the displayed equality holds unless ki, = 1. When fcs = 1 we get a corresponding contri- 
bution to Rk{z) of (j,'!^) times: 

Ck-lT{ka' - 1) _ x-fea' + l I - ^|-fca' + l + (2Q-e)\ 

2r((fc-l)a'-i)^' ^1 ^' 

since for fc > 2 we have ka' > 1 + {2a — e). The introduction of e handles the case when ka' = 
1 + 2a, which would have otherwise, according to Theorem I4.3.5f c) introduced a logarithmic 
remainder. In either case the remainder is 0(|1 — zj^*^" +i+{2a-e)y rpj^g ^^^^ when ^2 is nonzero 
but fci = is handled similarly by exchanging the roles of fci and ^2. 

The final contribution comes from the single term where both fci and fc2 are zero. In this 
case the contribution to Rk{z) is, recalling H7.2.1|l . 

Li_fc„,o(^) [\ CAT2(z/4)] - Li_fc,,oW (CAT(z/4) - 1) = Li.fe^.oW CAT(z/4). (7.3.4) 
Now, using Theorem 14. 1.31 

CAT(z/4) = 2 - 2(1 - z)i/2 + 0(|i - z|) = 2 + 2j^^ - j;^^ Li3/2,o(^) + 0(|1 - z|), 
so that 17.3.41) is 



{0 1 - fca < 0, 

o(ii-zn i-fc-a = o, 
0(1) 1 - fc-a > 0. 

When I — fca < 1 this is 0(|1 - zp'^"+2); when | — fca > 1, it is 0(1). In cither case we get a 
contribution which is 0(|1 - z|-fe"'+i+(2a-£))^ 
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Hence 



Rk{z) - 0(1) + 



.ki+k2=k 



+ 2fc 



2 r((fc-l)a+f-l 



(1 - 



+ 0(|l-^|-'="'+i+(2"-^)) 
= Cfc(l - z)-'^"'+i + 0(|1 - 2r'^'"'+i+(2"-^)), 

with the Cfe's defined by the recurrence H7.3.3|l . Now using (|7.3.1(l . the claim follows. □ 

Next, we handle the case when 1/4 < a < 1/2. 
Proposition 7.3.2. In the notation introduced above Proposition \7.cl.l\ when 1/4 < a < 1/2, 

Mk{z) ^ Cfe(l - + 0(|1 - zr'=("+^)+i), 

for k > I, where the C'k 's are defined by the recurrence 

Proof. For k = 1 the claim is true as shown in Section 17.21 with Ci defined at H7.3.3|l . We 
will now analyze each term in (|7.3.2|l for fc > 2 in a manner similar to that in the proof of 
Proposition 17.3.11 

First we consider the case that ki and ^2 are both nonzero. It follows from the induction 
hypothesis that 

If ^3 = then the corresponding contribution to Rk{z) is 

^ ^ ^ ^ Cfc,Cfc,(l - z)-'^-"'+i + o(|i - zp'^^'+i). 



4 Vfci 

If ^3 ^ we use Lemma r7.1.2l to express 
z ^ ^ ^ 

-M,Az)MM - 4r((fc, + 4;._i) Li-(..+>..K+2,o(^) 

4 1 ((fci + /e2jQ: - Ij 

The contribution to Rk{z) is then (^.^ times: 

-— %^^V^Li ,.3^^ fz)+Li_,3„,o(z)0O(|l-zr('=^+'=^)"'+i). 

41 [{ki + K2)a — 1) -ka' + — +2,0 

Now fc3 < fc — 2 so — fca' + + 2 < 1. Hence the contribution when fc3 7^ is 

o(|i - zp'-'^'+^+i) = o(|i - zp'^^'+i) 

by another application of Theorem 14 . 3 . 51" a) . 

Next we consider the case when fci is nonzero but k2 — Q. In this case using the induction 
hypothesis we see that 

-^MMM^M - CAT(z/4)M,,(z) = ^(1 - zy'^^'^'+h + 0(|1 - zj-'^-'+i). 
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Applying Lemma r7.1.2l to the last expression we get 

X C 

-MkJzWkJz) ^ — - — ^Li 3 (z) 

+ 0(|1 - zl-'^-'+i) - %[fcia' - i < 

2 1 [kia' — 2) 

The contribution to Rk{z) is hence (^J times: 



..T.,,^'; 1, Li iz) + Li_,3„,o(z) 0(|1 - zr^^"'+i) 

21 (fcia' — j) -fea'+— + 2-0 



Using the fact that a > 1/4 and /cs < fc — 1, we conclude that —ka' + + | < ^ so that the 
contribution is 

o(|i - zr''"'+^+^) = o{\\ - zp'^^'+i) 

by Theorem 14 . H . 5r a) . unless /C3 = 1. When kj, = 1 we get a contribution of 

A:-iy2r((fc-l)a'-i)^ ^ ^ VI I 

since k > 2 and a > 1/4 imply ka' > 3/2. The case when fc2 is nonzero but fci = is handled 
similarly. 

The final contribution comes from the single term where both ki and fc2 are zero. In this 
case the contribution to Rk{z) is 

Li_fc„,o(z) CAT2(z/4)] = Li_fe„,o(z) (CAT(z/4) - 1) = Li_fc,,o(z) CAT(z/4) 

/ 3 

which is easily seen to be 0(|1 — z| " 2). 

Hence, as in the proof of Proposition 17.3.11 we see that 

Rk{z) = Ck{l - + 0(|1 - zr'=("+5)+l) 

with the Cfe's defined by the recurrence H7.3.3|) . and the claim follows using (|7.3.1|l . □ 

Large toll functions (a > 1/2) 

When a > 1/2 there is no need to apply the centering techinques. Define fJLn{k) :— EAT^ 
and p,n(k) := PnlJ"n{k)/^^. Let Mk{z) denote the ordinary generating function of fin{k) in n. 
Observe that Mo{z) = CAT(z/4). As earlier, conditioning on the key stored at the root, we 
get, for fc > 2, 

fin{k) = -^^^^flj-lik) +fn{k), n>l, 
J = l 

where 

1 / A: \ " 

kl+k2 + k3=k ^ j=l 

ki,k2<k 
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for n > 1 and ro(fc) := fJ-o{k) — iio{k) — 0. Let Rk{z) denote the ordinary generating function 
of fn{k) in n. Then 

— Rk{z) 

Mk{z) 

s/l- z 

and 

Mz)= J2 (fc .\)(n^)°'^)0[jM,,(z)M,,(z)]. (7.3.5) 

ki.k2<k 

We can now state the result about the asymptotics of the generating function Mk when 
a > 1/2. The case a = 1/2 will be handled subsequently, in Proposition 17.3.41 

Proposition 7.3.3. Let e > be arbitrary, and define 

a— ^ ^<Q!<1 

-e a^l (7.3.6) 
a > 1. 

Then the generating function Mk{z) of p,n{k) satisfies 

Mk{z) = Ck{l- z)-'^("+5)+5 + 0(|1 - zr'^("+5)+^+=) 
for k>\, where the Ck 's are defined by the recurrence (|7.3.3|l . 

Proof. The proof is very similar to those of Propositions 17.3.11 and 17.3.21 We present a sketch. 
The reader is invited to compare the cases enumerated below to those in the earlier proofs. 

When k — 1 the claim is true by (|7.2.8|l . We analyze the various terms in H7.3.5|) for fc > 2, 
employing the notational convenience a' := a + 

When both ki and k2 are nonzero then the contribution to Rk{z) is 



i (fci) ^^^^^^(1 - + 0(|1 - z|-'="'+^+i) 



when fca = and is 0(|1 — z\ +'=+i) otherwise. 

When ki is nonzero and k2 = the contribution to Rk{z) is 

^_Cfc^r(fc«^_-2) _ + o(|i _ ^|-fea'+c+i) 

2r((/c- l)a' - i)^ ^ VI I 7 

when fca = 1 and 0(|1 — zj"''" otherwise. The case when fc2 is nonzero and fci = is 

identical. 

The final contribution comes from the single term when both ki and fc2 are zero. In this 

case we get a contribution of 0(|1 - z\~''"~^2) which is 0(|1 - z|"'="'+'=+i). Adding aU these 
contributions yields the desired result. □ 

The result when a = 1/2 is as follows. Recall that L{z) :— ln((l — z)~^). 

Proposition 7.3.4. Let a — 1/2. In the notation of Proposition \7^y.y\ 

_ 1 

Mfc(z) = (1 - z)-^+2 ^ Ck,iL'^-\z) + 0(|1 - zj-'^+i-^) 



1=0 
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for k > 1 and any e > 0, where the Ck.i 's are constants. The constant multiplying the lead-order 
term is given by 

(2fc^2)! 

^fc^O - 22ft-2(fc_l),^fc/2- (7.3.7) 

Proof. For k = 1 the claim is true by (|7.2.7|) . For fc > 2 we estimate each term in (17.3. 5II . 

For this paragraph, suppose that ki and k2 are both nonzero. Then using the induction 
hypothesis we get 

1=0 m=0 

+ 0(|l-zr('=^+'=^)+i-2^). (7.3.8) 

If /ca = the corresponding contribution to Rk{z) is 

^ ^ ^ Vl - z)-'=+i ^ Au,L^-\z) + 0{\l- zr'^+i-2^), 



4Vfci 



i=0 



where Ak^ = CkiflCk-ki.o and (^fc.i)f=i can be determined easily from H7. 3.8(1 . If fcs 7^ then 
we use Lemma [7.1.21 once again to get 

^MkAzWkA^) = E ^x+fe.,;Li-(fc,+fc.)+2,fc,+fe.-/(2) + 0(|l-2r''^+'^^+^-'^), 

where (^fei+fe2.0f=J'^^ can be determined using H7.3.8|) and Lemma [V. 1.21 The corresponding 
contribution to Rk{z) is then [^^ times: 

ki+k2 ^ g 

E ^^k.^,...,k.^^ + Li_^^^(z) 0(11 - .r(^^+'=^)+.--). 

Invoking Lemma lV.l.ll we see that the contribution when fcs 7^ is 0(|1 — 2;|~'^~''"2"+-'^~'^)^ which 
is Odl-zj-'^+i-'). 

Next suppose that k\ is nonzero but fc2 = 0. In this case using the induction hypothesis and 
we see that 

z- 



By Lemma f7. 1 . 21 this is 

E^^^^'Li_^.4...-.W + 0(|l-^l"'^^"^"^)' 
1=0 ^ 

where (Bkx.i)\=o constants. The corresponding contribution to Rk(z) is then (^J times: 

fci 



1=0 
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Whcnfcg > 2 the contribution is 0(|l-zr''+ '2 +2"'), which is 0(|l-zp'=+2^'). When /cg = 1 
the contribution to Rk{z) is 

fe-l _ 3 

(1 - ^ Bk-i,iL'^-'-\z) + 0(|1 - zr'+2-^), 



1=0 

for some suitably defined constants Bkj- Note that there is no lead-order contribution from this 
term. The terms where fci = and /c2 is nonzero are similarly handled. 

_fc 1 

The final contribution from the single term with fci=fc2 = 0isO(|l — z| 2 + 2)^ which is 
3 



0(|1 -zr''+2) for k>2. 

Summing all the contributions we get 



Rkiz) = (1 - J2 Ck,iL''-\z) + 0(|1 



z\-'+^^-% 



where, for fc > 2, 



ki=i ^ ^ 
fc-i 



Cm = ^ 51 U )^'^^'+2Bfc_i,i, 1>1. 

ki = l ^ ^ 



The recurrence for C^.o can be easily solved to yield 

1/2- 



The result follows immediately. 



1-1 



(2fc- 2)! 



22fe-2(fc- l)!7rfc/2' 



□ 



7.4 Asymptotics of moments 



For < a < 1/2, we have seen in Propositions I7.3.H and [7.3.21 that the generating function 
Mfe(z) of /x„(fc) — /3„/2„(fc)/4" satisfies 

where c := min{2a — e, 1/2}. By Theorem[ 



^fe(a+i)-| 



Recall that 



so that 



(3n = 



4' 



0{n 



fc(Q+^)-|- 



,3/2 



r(fc(a + i)-i) 



(7.4.1) 
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For a > 1/2 a similar analysis using Proposition 17. 3 .1^ yields 



r(fc(a 



(7.4.2) 



with now c as defined at H7.3.6I) . Finally, when a = 1/2 the asymptotics of the moments are 
given by 



finik) = (^-^^ (nln7i)'= + 0(n'=(lnn)'=-i). 



7.5 The limiting distributions 



(7.4.3) 



In Section 17.5.11 we will use our moment estimates H7.4.1f) and 17.4.2|) with the method of 
moments to derive limiting distributions for our additive functionals. The case a = 1/2 requires 
a somewhat delicate analysis, which we will present separately in Section 17.5.21 

7.5.1 a ^1/2 

We first handle the case < a < 1/2. (We assume this restriction until just before Proposi- 
tion [732I) We have 



/i„(l) = EX„ = E [X„ - Coin + 1)] = ^^n"+^ + 0(n"+5--) 

r(a) 



(7.5.1) 



with c :— min{2a — e, 1/2} and 
So 

where 



r(2a+ i) 



VarX„ = VarX„ - /i„(2) - [/i„(l)]^ = a 



2 _ ^2^2a+l 



+ 0(n2"+i-^). 



r(2a + i) r2(a)- 



We also have, for fc > 1, 



E 



An(fc) 



C^.^/7^ 



Ma+h r(/c(a+i)-i) 



T^^ + 0(n-^). 



(7.5.2) 
(7.5.3) 

(7.5.4) 



The following lemma provides a sufficient bound on the moments facilitating the use of the 
method of moments. 

Lemma 7.5.1. Define a' a + i. There exists a constant ^ < oo depending only on a such 
that 



Ck 



for all k > 1. 



Proof. The proof is by induction. For the basis case fc = 1, we simply choose A large enough 
so that the claim is true. For the induction step, assume k > 2. Define Sk '■= Ck/kl. Divid- 
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ing (|7.3.3|l by fc! we get 



Sfc 



fc-i 



SjSk-j + Sk-l 



r(fca'- 1) 
T{ka' -1-a)' 



(7.5.5) 



Using Stirling's approximation we can find a constant 7 < oo depending only on a so that, for 
all fc > 2, 

r{ka' - 1) 



T{ka' -1-a) 



Using this in H7.5.5(l we get 



\Sk 



^ fe-i 



By the induction hypothesis 



sk\ < ^Y.^fik - j)''-T' + lA^-\k - l)"'(^-i)fc° 



Now x^{k — x)^^^ decreases as x increases for < a: < fc/2 so that we can bound the sum above 
by the sum of two times the j — 1 term and k times the j = 2 term. This and trivial bounds 
yield 



\Sk\ < 



j^k 



(fc-l)'^~i + -x4(fc-2)'=-2 



j^k 



fe-i 



2fc 



j^kl^a'k-\ 
A 



'-—{ik^-^Y'+A^-^K 



k T > a'fc-i 



A 



3" ,71 



A^k"" < A^k" 



the last inequality justified for all fc > 2 by choosing A large enough. This proves the claim. □ 
It follows from Lemma 17.5.11 and Stirling's approximation that 

Cfcv^ 



fc!r(fc(a + i) - i) 



(7.5.6) 



for large enough B depending only on a. Using standard arguments 3, Theorem 30.1] it follows 
that Xn suitably normalized has a limiting distribution that is characterized by its moments. 
Before we state the result, we observe that the argument presented above can be adapted with 
minor modifications to treat the case a > 1/2, with X„ replaced by X„. We can now state a 

result for a 7^ 1/2. We will use the notation to denote convergence in law (or distribution). 
Proposition 7.5.2. Let X„ denote the additive functional on Catalan trees induced by the toll 
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sequence (n")„>o. Define the random variable Yn as follows 

X„-Co(n + l) 



< a < 1/2, 



Y ^ 

a > 1/2, 



where 



^ aPn . 1 /2n\ 



ri=0 

Then 

Y Y- 

here Y is a random variable with the unique distribution whose moments are 

r(fc(a + - i) 

where the Ck 's satisfy the recurrence 

l^/fc\ r(fca+|-l) r(Q!-i) 
Cfe = -V C,Cfe_, +fc ^ Cfe_i,fc>2; Ci = -^-^. 



The case a = 1/2 is handled in Section [7.5.21 leading to Proposition I7.5.ItI and a unified 
result for all cases is stated as Theorem l7.5.7l 

Remark 7.5.3. We now consider some properties of the limiting random variable Y = ^(0;) 
defined by its moments at H7.5.7|) for a ^ 1/2. 

(a) When a = 1 setting flk := Cfe/2 we see immediately that 

r((3A:-l)/2) 

where 

'^-1 1 
2nk.^^i ,1 + fc(3fc ~ 4)f]fe_i, f^i = -. 

Thus Y has the ubiquitous Airy distribution and we have recovered the limiting distribution 
of path length in Catalan trees [toI - It^ . The Airy distribution arises in many contexts 
including parking allocations, hashing tables, trees, discrete random walks, mergesorting, 
etc. — see, e.g., the introduction of |30j which contains numerous references to the Airy 
distribution. 

(b) When a = 2, setting 77 := y/V2 and aoA := 2^'^^Ci, we see that 

p ; _ y/^ 

- 2(5'-2)/2r((5Z-l)/2)"°''' 

where 

aoA = ( J"Ojao,i-i + 1(^1 - 4)(5^ - 6), ao,i = 1. 

j=i V-?/ 
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We have thus recovered the recurrence for the moments of the distribution £{r]), which arises 
in the study of the Wiener index of Catalan trees [ZH proof of Theorem 3.3 in Section 5]. 

(c) Consider the variance defined at (|7.5.3|l . 

(i) Figure 17.5.11 plotted using Mathematica, suggests that tr^ is positive for all a > 0. 
We will prove this fact in Theorem 17.5.71 There is also numerical evidence that 




is unimodal with max^ a'^{a) = 0.198946 achieved at a = 0.682607. (Here = denotes 
approximate equality.) 

(ii) As a ^ cxD, using Stirling's approximation one can show that ^ (v^^ l)a^^. 

(iii) As a i 0, using a Laurent series expansion of T{a) we see that ^ 4(1 — ln2)Q!. 

(iv) Though the random variable Y{a) has been defined only for a ^ 1/2, the variance 
has a limit at a = 1/2: 

lini a'{a)^^--. (7.5.8) 

(d) Figure 17.5.21 shows the third central moment E[y — EF]"^ as a function of a. The plot 
suggests that the third central moment is positive for each a > 0, which would also establish 
that Y(a) is not normal for any a > 0. However we do not know a proof of this positive 
skewness. [Of course, the law of Y{a) is not normal for any a > 1/2, since its support is a 
subset of [0, oo).] 

(e) When a — 0, the additive functional with toll sequence (n" = l)ri>i is n for all trees with n 
nodes. However, if one considers the random variable a~^/^y(a) as a | 0, using H7.5.7I) and 
induction one can show that a"i/2y(a) converges in distribution to the normal distribution 
with mean and variance 4(1 — In 2). 

(f) Finally, if one considers the random variable a^/^Y{a) as a — > oo, again using (|7.5.7|l and 
induction we find that a^^'^Y{a) converges in distribution to the unique distribution with 
fcth moment \/M. 

Remark 7.5.4. Let Y be the unique distribution whose fcth moment is a/M for k = 1,2,.... 
Taking Y* to be an independent copy of Y and defining X := YY* , we see immediately that X 
is Exponential with unit mean. It follows by taking logarithms that the distribution of In Y is 



82 



7.5. The limiting distributions 




a convolution square root of the distribution of InX. In particular, the characteristic function 
(f> of Iny has square equal to r(l + it) at t G {—oo, oo). By exponential decay of r(l + it) as 
t ±00 and standard theory (see, e.g., Chapter XV]), InF has an infinitely smooth density 
on (—00, 00), and the density and each of its derivatives are bounded. 

So Y has an infinitely smooth density on (0, 00). By change of variables, the density fy of 
Y satisfies 

r I . /in r (Iny) 

jY{y) = • 

V 

Clearly fviv) is bounded for y not near 0. (We shall drop further consideration of derivatives.) 
To determine the behavior near 0, we need to know the behavior of /in y (In y)/y as ?/ — > 0. Using 
the Fourier inversion formula, we may equivalently study 

1 f°" 

e'^hnvi-x)^— j e(i+")-0(i)dt, 

as a; ^ 00. By an application of the method of steepest descents [(7.2.11) in with — 1, 
13 — 1/2, w the identity map, zq = 0, and a ~ 0], we get 

Hence fy is bounded everywhere. 

Using the Cauchy integral formula and simple estimates, it is easy to show that 

fviy) = o{er^'y) asy->oo 

for any M < 00. Computations using the WKB method [H suggest 

/y(y) ~ (2/^)i/V/'exp(-yV2) as y ^ ex., (7.5.9) 

in agreement with numerical calculations using Mathematica. [In fact, the right-side of H7.5.9() 
appears to be a highly accurate approximation to fyiy) for all y > 1.] Figure 17.5.31 depicts 
the salient features of fy. In particular, note the steep descent of fy{y) to as | and the 
quasi-Gaussian tail. 
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7.5.2 a = 1/2 

For a — 1/2, from H7.4.3|) we see immediately that 



E 



nlnr^ 



1 



O 



1 

Inn 



Thus the random variable Xn/{n\iin) converges in distribution to the degenerate random vari- 
able I/a/tt- To get a nondegenerate distribution, we carry out an analysis similar to the one that 
led to (j7.2.7|l . getting more precise asymptotics for the mean of X„. The refinement of H7.2.7|l 
that we need is the following, whose proof we omit: 

Aiz) CAT(z/4) = ^(1 - z)-^/^L{z) + D„{1 - z)-^'^ + - z\\-'), 



where is a constant whose value we will not need in our subsequent analysis. By Theo- 
rem ^^21 this leads to 

EX„ = ^nlnn + Din + Ofn"), (7.5.10) 

where D\ is another constant whose value we will not need in our subsequent analysis. Now 
analyzing the random variable X„ — 7r~^/^nlnri in a manner similar to that of Section 17.31 we 
obtain 



- In 2 

TT 2 



Var[X„-7r"^/2nlnn] = 
Using (|7.5.10|l and (|7.5.11|l we conclude that 

Xn — TT^^/^nlnn — D\n 



0(n2+^). 



(7.5.11) 



0(1) 



and 



Var 



= -ln2 ^ lim cr^(a), 

TT 2 a^l/2 ^ ' 



(7.5.12) 



where a"^ = a'^ia) is defined at (f733|) for a 7^ 1/2. [Recall (f7TH|) of Remark 17331 ] 

It is possible to carry out a program similar to that of Section 17.31 to derive asymptotics of 
higher order moments using singularity analysis. However we choose to sidestep this arduous, 
albeit mechanical, computation. Instead we will derive the asymptotics of higher moments using 
a somewhat more direct approach akin to the one employed in '2l'|. The approach involves 
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approximation of sums by Riemann integrals. To that end, define 



X„ :=X„-7r-i/2(„+i)in(n+f)-i?i(n + l), and /t„(A:) (7.5.f3) 

Note that Xo ^ -Di, /i„(0) = /3„/4"+\ and /to(fc) = {-DiY/A. Then, in a now familiar 
manner, for n > 1 wc find 



3 = 1 



where now we define 



fci ,k2<k 



^(j In j + {n + l-j) ln(n + 1 - j) - (n + f ) ln(n + f ) + y^n^^^) 



fc3 



Passing to generating functions and then back to sequences one gets, for n > 0, 



J=0 



4i 



Using induction on fc, we can approximate r„(fc) and /i„(fc) above by integrals and obtain the 
following result. We omit the proof, leaving it as an exercise for the ambitious reader. 

Proposition 7.5.5. Let Xn be the additive functional induced by the toll sequence {n^^'^)n>i 
on Catalan trees. Define X„ as in \7.5.1!^ . Then 

E [Xn/n]'' = iT^k + o(l) as n ^ oo, 
where mo = 1, mi ~ 0, and, for k > 2, 



nik 



wher 



1 r(fc-l) 
4V^r(A:- i) 



Jki,k2.k3 



E 



ki+k2+k3=k 
ki ,k2 <k 



x''i~2 (1 - xp'^2[xliix + (1 - a;)ln(l - x)]"'' dx. 



(7.5.14) 



Furthermore X„/(n + 1) ^ 1", where Y is a random variable with the unique distribution whose 
moments are 'EY^ — rrik, k>0. 

7.5.3 A unified result 

The approach outlined in the preceding section can also be used for the case a ^ 1/2. For 
completeness, we state the result for that case here (without proof). 

Proposition 7.5.6. Let Xn be the additive functional induced by the toll sequence {n°')n>i on 
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Catalan trees. Let a' :— a + ^. Define Xn as 



Xn-Co{n+l)- ' + 0<a<l/2, 



Xri 



Via) 



(7.5.15) 



Via) 



a > 1/2, 



where 



Then, for fc = 0, 1, 2, . 



E 



Co := > -;— ■ 



Xn/n" = ruk + o(l) as n —> oo, 



where mo = I, rui = 0, and, /or k > 2, 
1 T{ka' - 1) 



TOfe 



40Fr(fca' - i; 



E 

fcl+fc2+fc3 = fe 

fci ,k2<k 



k 

fci,/C2,^3 



T{a) 



(7.5.16) 



Jk,MM-= I x'=i"'-i(l-a;)'^^"'^iK' + (l~a;)"'-l]'=^dx. 



with 



Furthermore, Xn Y, where Y is a random variable with the unique distribution whose mo- 
ments are E Y'' — ■ 

[The reader may wonder as to why we have chosen to state Proposition 17 . 5 .HI using several 
instances of n + 1, rather than n, in H7.5.15(l . The reason is that use of n + 1 is somewhat more 
natural in the calculations that establish the proposition.] 

In light of Propositions 17.5.21 17.5.51 and 17.5.61 there are a variety of ways to state a unified 
result. We state one such version here. 

Theorem 7.5.7. Let Xn denote the additive functional induced by the toll sequence (7i")n>i 
on Catalan trees. Then 

Xn ~ E Xn ^ 

where the distribution of W is described as follows: 
(a) Fora^ 1/2, 



W 



1 



Y 



T{a) 



with (7^ 



C^TT 



>o. 



r(2a+i) r2(a) 

where Y is a random variable with the unique distribution whose moments are 



EY'^ = 



r(fc(a + i)-i)^ 



and the Ck 's satisfy the recurrence i7.3.'J\ ). 
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(b) Fora^ 1/2, 

W^—, with := -\n2 , 

a TT 2 

where Y is a random variable with the unique distribution whose moments nik ~ E Y'^ are given 
by rrXTTP . 

Proof. Define 



VVarX„ 



(a) Consider first the case a < 1/2 and let a' := a + i. By l|7.5.1|) . 

EX„ = Coin + 1) + ^i^n"' + o(n"'). (7.5.17) 
r(a) 

Since X„ defined at H7.5.15|l and X„ differ by a deterministic amount, VarX„ = VarX„. Now 
by Proposition 17. 5 .51 

Var X„ ^ E - {-£ X„f = {m2 + o{l))n^"' - {mj + o{l))n^°'' = {m2 + o{l))n^°'' . (7.5.18) 
So cr^ equals m2 defined at H7.5.16|l . namely, 

1 r(2a'- 1) fTja-^)' ' 



Jo,o. 



2- 



40Fr(2a'- i) V r(a) 
Thus to show (T^ > it is enough to show that Jo, 0,2 > 0. But 

Jo,0.2 = /' X"'/'(1 - a;)-3/2[^a' ^ (1 _ „ ^^2 

which is clearly positive. Using H7.5.17() and H7.5.18() . 

X„ - Co(n + 1) - ^n"' + o(n"') 



(1 + o(l))crn"' 



so, by Proposition 17 . 5 . 2l and Slutsky's theorem 3, Theorem 25.4], the claim follows. 
The case a > 1/2 follows similarly. 

(b) When a = 1/2, 

E Xn = —i=n In n + D\n + o(n) 

by rrOn and 



Var X„ = In 2 - - + o(l) j 
by H7.5.12|) . The claim then follows easily from Proposition 17.5.51 and Slutsky's theorem. □ 
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Catalan model: the shape functional 

We now turn our attention to the shape functional for Catalan trees. The shape functional is 
the cost induced by the toll function i„ = logn, n > 1. For background and results on the shape 
functional, we refer the reader to 21] and 55]. 

In the sequel we will improve on the mean and variance estimates obtained in |2lj ] and derive 
a central limit theorem for the shape functional for Catalan trees. The technique employed is 
singularity analysis followed by the method of moments. 

8.1 Mean 

We use the notation and techniques of Section [7.21 again. Observe that now T{z) = Lio,i(z) 
and H7.2.5|l gives 

CAT(z/4) = 2 - ^,^A_[Li3/,_o(,) - C(3/2)] + 2 (^1 - ^p^) " ^) + " ^l'^')" 

So 

r(z) CAT(z/4) - -^.^A^ Li3/2,i(z) + + 5(1- z) + 0{\1 - z\l-'), 

where c and c denote unspecified (possibly 0) constants. The constant term in the singular 
expansion of T{z) CAT(z/4) is already known to be, using Theorem 14. 3. 51 

Co = T{z) CAT(z/4) _ = Y.^ogn)^. 

n— 1 

Now using Theorem 14. 1 . 31 to obtain the singular expansion of Li3/2. 1(^)7 

T{z) CAT(z/4) = Co - 2(1 - zf'^L{z) - 2(2(1 - log(2)) - 7)(1 - zf'^ + 0{\\- z|), 
so that 

A{z) CAT(z/4) = Co(l - - 2L{z) ~ 2(2(1 - log2) - 7) + 0(ll - zj^/^). (8.1.1) 

Using Theorem 14. 1 . 21 and the asymptotics of the Catalan numbers we get that the mean a„ of 
the shape functional is given by 

a„-Co(n+l)-2x/^ni/2 + Cl(l), (8.1.2) 

which agrees with the estimate in Theorem 3.1 of 21] and improves the remainder estimate. 

8.2 Second moment and variance 

We now derive the asymptotics of the approximately centered second moment and the variance 
of the shape functional. These estimates will serve as the basis for the induction to follow. We 
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8.2. Second moment and variance 



will use the notation of Section [731 centering the cost fmiction as before by Co{n + 1). 
It is clear from (I8.1.1|l that 

Mi{z) = -2L{z) - 2(2(1 - log2) - 7) + 0(|1 - z]^^"^), (8.2.1) 
and (|7.3.2ll with k = 2 gives us, recalling H7.2.1II . 

R2{z) = Cl + CAT(z/4) Lio,2(z) + 4Lio4(z) CAT(z/4)Mi(z)] + |Mf (z). (8.2.2) 

We analyze each of the terms in this sum. For the last term, observe that z/2 ^ 1/2 as z ^ 1, 
so that 

|iW?(z) = 2L\z) + 4(2(1 - log2) - ^)L{z) + 2(2(1 - log2) - 7)' + 0{\1 ~ z|5-^), 
the e introduced to avoid logarithmic remainders. The first term is easily seen to be 

CAT(z/4) Lio,2(z) = K + 0(|1 - z\\-') 

by Theorem 14. H. 51 where 

n— 1 

For the middle term, first observe that 

^ CAT(z/4)Mi(z) = -L{z) - (2(1 - log2) - 7) + (1 - zf'^L{z) + 0(|1 - z\^/^) 

and that L{z) = Lii^o(z). Thus the third term on the right in H8.2.2|l is 4 times: 

-Lii,i(z) + c + 0(|1 - z\\'^') = -\l\z) + 7L(z) + c + 0(|1 - z\\-'). 

[The singular expansion for Lii^i(z) was obtained using (|4.1.4|l . We state it here for the reader's 
convenience: 

Lii,i(z) = ]^L\z) ~ jL{z) + c+0{\l - z|), 
where c is again an unspecified constant.] Hence 

R2{z) = 8(1 - log2)L{z) + c + 0{\l - z\^-'), 

which leads to 

M2{z) = 8(1 - log2)(l - z)-^/^L{z) + c(l - z)-i/2 + 0(|i _ z\~'). (8.2.3) 

We draw the attention of the reader to the cancellation of the ostensible lead-order term L'^{z). 
This kind of cancellation will appear again in the next section when we deal with higher moments. 
Now using singularity analysis and estimates for the Catalan numbers we get 

/i„(2) = 8(l-log2)nlogn + cn + 0(n5+<^). (8.2.4) 

Using 

Var X„ = /i„(2)-/i„(l)2 = 8(l-log2)nlogn + cn + 0(n^+'), 
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which agrees with Theorem 3.1 of [21| (after a correction pointed out in 55]) and improves the 
remainder estimate. In our subsequent analysis we will not need to evaluate the unspecified 
constant c. 

8.3 Higher moments 

We now turn our attention to deriving the asymptotics of higher moments of the shape func- 
tional. The main result is as follows. 

Proposition 8.3.1. Define Xn :— Xn — Co{n+l), with Xq :=0;/i„(A;) :=EiX!^, with fln{0) = 1 
for all n > 0; and finik) '.— /3„/i„(fc)/4". Let Mk{z) denote the ordinary generating function of 
finik) in the argument n. For k > 2, Mk{z) satisfies 

_ k-i L'-^/'J k 

Mk{z) = (1 - z)-— Gk,,L^^I^^-'{z) + 0(|1 - ^1-2+1-), 

i=o 

with 

1 (2l\ 

^2i,0 = 4 5Z (^2j^2,'0C2;-2j,0, C2,0 = 8(1 - log 2). 

Proof. The proof is by induction. For fc = 2 the claim is true by l|8. 2.3(1 . We note that the claim 
is not true for fc = 1. Instead, recalling (|8.2.1|l . 

m{z) = -2L(z) - 2(2(1 - log2) - 7) + 0(|1 - z\^^'^). (8.3.1) 

For the induction step, let A: > 3. We will first get the asymptotics of Rk{z) defined at I]?. 3.2(1 
with T(z) = Lio,i(z). In order to do that we will obtain the asymptotics of each term in the 
defining sum. We remind the reader that we are only interested in the form of the asymptotic 
expansion of Rk{z) and the coefficient of the lead order term when k is even. This allows us to 
"define away" all other constants, their determination delayed to the time when the need arises. 
For this paragraph suppose that ki > 2 and k2 > 2. Then by the induction hypothesis 

1 k,+k, Lfcl/2J + Lfe2/2J 

-^MkAzWkM - 1(1 z)-^^' ^.„..,iL../2JH../2j-.(,) 

1=0 

fci+fca , 3 

+ 0(|l-zr^— + 2-^), (8.3.2) 

where Ak-^^k2,o = CkiflCk^fl- (a) If A^a = then fci + fc2 = fc and the corresponding contribution 
to Rk{z) is given by 

1 I Lfci/2j + L(fe-fci)/2j 

^""Vl- z)-2+' £ A,„,_,,,LLfc^/2J + L('=-feO/2j-^(,) + o(|l - zr 2+5-). 

1=0 

(8.3.3) 

Observe that if k is even and ki is odd the highest power of L{z) in 1(8. 3. 3() is [fc/2j — 1. In 
all other cases the the highest power of L{z) in (|8.3.3(l is [fc/2j. (b) If fca 7^ then we use 
Lemma [7. 1.21 to express ((8.3.2(1 as a linear combination of 



4 Vfci 



Lfei/2J + Lfc2/2J 
1=0 
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_ /ci+fc2 3 

with a remainder that isO(|l — zj 2 ^^2 ). When we take the Hadamard product of such 
a term with Lio,fc3(z) we will get a linear combination of 



Li ki+k2 , ^ , , , (z) 



LA;i/2j + Lfe2/2j 



2 J;^o 

and a smaller remainder. Such terms are all 0(|1 — z| 2 ^ ), which is 0(|1 — z| 2 + 2 ). 

Next, consider the case when fci = 1 and k2 > 2. Using the induction hypothesis and H8.3.1|l 
we get 



-Mfe,(z)Mfe,(z) = --(l-z)-— 5] B,,^,d'\ '(z) + 0(|l-zr-+i-2'), (8.3.4) 



with i?fc2,o — Cfc2,o- (a-) If fcs = then k2 = k — 1 and the corresponding contribution to Rk{z) 
is given by 

, , L(fe-1)/2J+1 I fc_i I 



^(l-z)-2+i ^ Bk-i^.d ^ \ ■'(z) + 0(11-^1-2 + 2-2^). (3 3 5) 

(b) If fca ^ then Lemma [V . 1 . 21 can be used once again to express (18.3. 411 in terms of generalized 
polylogarithms, whence an argument similar to that at the end of the preceding paragraph yields 

fc2-i _ii 3_ 

that the contributions from such terms isO(|l — zj 2 '^)j which isO(|l — z| 2 + 2 '^). The 

case when fci > 2 and fc2 = 1 is handled symmetrically. 

When fci = fc2 = 1 then {z/A)Mi^^ {z)M}^^{z) isO(|l — z|-^) and when one takes the Hadamard 
product of this term with Lio_fc3(z) the contribution will be 0(|1 — z\~'^'^). 

Now consider the case when fci = and k2 > 2. Now Mq{z) — CAT(z/4), so that 

1 k -1 L'^'^/^J k 

By Lemma f? . 1 . 21 this can be expressed as a linear combination of 

^ Lfe2/2J 
2 +^'^ ) j=0 

with a 0(|1 — z\ 2 + '^) remainder. When we take the Hadamard product of such a term with 
hiQ^ksiz) we will get a linear combination, call it S{z), of 

Li fe2-i^, (z) \ 
— + l,J+k3 j .^^ 

with a remainder of 0(|1 - z\~^'^^~^''), which is 0(|1 - z|-2 + 2-2') unless fc2 = fc - 1- When 

fc2 = fc — 1, by Lemma |7 . 1 . 21 the constant multiplying the lead-order term Li k~i {z) in 

-■2+2'L^~J 

{ — _|_2 [_ J } 

^(z) is '^''2^ " Mo ^ ^ . When we take the Hadamard product of this term with Lio,fe3(z) 
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we get a lead-order term of 

Now we use Lemma IV .1.1 1 and the observation that x'j^'^^ fj.'l^'^^ = 1 to conclude that the contri- 
bution to Rk{z) from the term with fci = and fc2 = fc — 1 is 

i?,,,LLVJ+W(,) + 0(|l_,|-|+i-), (8.3.7) 

with Dk.o — Ck-ifi- Notice that the lead order from this contribution is precisely that 
from (|8.3.5|1 but with opposite sign; thus the two contributions cancel each other to lead order. 
The case /c2 = and fci > 2 is handled symmetrically. 

The last two cases are ki — 0, k2 — I (or vice-versa) and ki = k2 = 0. The contribution 

from these cases can be easily seen tobeO(|l — z| 2"^2 ). 

We can now deduce the asymptotic behavior of Rk{z). The three contributions are H8.3.3|l . 

k 

H8.3.5I) . and (|8.3.7|l . with only (|8.3.3|l contributing in net a term of the form (1-z)" 2 +iLLfe/2j (2) 
when k is even. The coefficient of this term when k is even is given by 



k 



- y . 

4 ^ \ki 

0<ki<k ^ 
ki even 



Cki,oCk2,0- 



Finally we can sum up the rest of the contribution, define Ckj appropriately and use H7.3.1|) to 
claim the result. □ 



8.4 A central limit theorem 

Proposition 18. 3. ll and singularity analysis f Theorem I4.1.2|) allows us to get the asymptotics of 
the moments of the "approximately centered" shape functional. Using arguments identical to 
those in Section 17^ it is clear that for fc > 2 

Mn(fc) = ^^n'^/^[logn]L^-/2j +o(,fe/2[i,g„]L'c/2j-i). 
This and the asymptotics of the mean derived in Section l8. II give us, for fc > 1, 



E 



V n log n 



2k 



C2fe,0V^ 



E 



\/ n log n 



2k-l 



r(fc-i)' 

as n — > cxD. The recurrence for 6*2^.0 can be solved easily to yield, for fc > 1 

^ (2fc)!(2fc-2)! 2fc 
^2fe,o 2'=22'=-2fc!(fc-l)! ' 

where ti^ := 8(1 - log 2). Then using the identity 

FT 



0(1) 



r(fc 



-)2fe-2 



(fc-1)! 
(2fc-2)! 
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we get 



-a 



r(fc-i) 2^k\ 

It is clear now that both the "approximately centered" and the normalized shape functional are 
asymptotically normal. 

Theorem 8.4.1. Let denote the shape Junctional, induced by the toll sequence (logn)„>i, 
for Catalan trees. Then 

> A/ (0,(7) and ^ TV (0,1), 

y/n log n V Var A„ 

where 

n— 1 ^ 

a^d CT^ := 8(1 -log 2). 



8.5 Sufficient conditions for asymptotic normality 

In this speculative final section we briefly examine the behavior of a general additive functional 
Xn induced by a given "small" toll sequence (t„). We have seen evidence [Remark 17 .5.31(311 ] 
that if (i„) is the "large" toll sequence for any fixed a > 0, then the limiting behavior is 
non-normal. When t„ = logn (or i„ = and a | 0), the (limiting) random variable is normal. 
Where is the interface between normal and non-normal asymptotics? We have carried out 
arguments similar to those leading to Propositions 17. 5 . 5l and 17. 5 .Bl fsee also [i^) that suggest a 
sufficient condition for asymptotic normality, but our "proof" is somewhat heuristic, and further 
technical conditions on (t„) may be required. Nevertheless, to inspire further work, we present 
our preliminary indications. 

We assume that tn = t(n), where t{-) is a function of a nonnegative real argument. Suppose 
that x~^^'^t{x) is (ultimately) nonincreasing and that xt' {x) is slowly varying at infinity. Then 



where 

Furthermore, 
and 



EX„ = Co(n + 1) - (1 + o{l))2^n'^/'^t'{n), 



Co - Xl*"§r' 

n=l 



Var X„ - 8(1 - log 2) [nt' {n)]^n log n, 
X„ - Coin + 1) c _^(o^^2)^ ^j^^^.^ ^2 ^ _ i^g2). 



nt'{n)y/n \ogn 

This asymptotic normality can also be stated in the form 

V Var Xn 
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Appendix A 



Solution of an Euler differential equation 

We now provide the proof of Theorem l2.3.2l which states the general solution of the differential 
equation (|2.3.4|l with initial conditions = 6j, < j < m — 2. This linear differential equation 
can be written in the form 

Lg = h (A.l) 

where the operator L is defined as 

(L9)(z) := - m!(l - z)-^"^-^^ g(z). (A.2) 

We seek the solution g = A corresponding to input h = 

Equations of the form (|A.l|l - ljA.2|) are members of a class known as Euler differential equa- 
tions. In this appendix we discuss a general method for solving Euler equations, restricting 
attention, for the sake of definiteness and practicality, to (jA.l|) - ljA.2p . We assume that the 
reader is familiar with the theory of linear differential equations with constant coefficients (see, 
e.g., i). 



The homogeneous solution 



The technique for solving Lg = is quite easily summarized: make the change of variable z = 
1 — e^^, that is, x — ln((l — z)~^). For notational convenience we will abbreviate In ((1 — z)^^) 
as L{z). Then consider the function g defined by 



5(x) := .g(l - e i.e., 5(2) = .g(L(z)). 

Lemma A.l. The derivatives of g are related to those of g by 



9 



W(z) = (i-zr'=$: 



3=0 



~9^^\L{z)), 



(A.3) 



(A.4) 



where [^] denotes a signless Stirling number of the first kind. 

Proof. Since [[]] =1, the basis of the induction is ljA.3|l . For the induction step we differentiate 
the induction hypothesis (|A.4p to get 



/'^-+i)(z) = (l-z)-('=+i) 

= (l-z)-(^+i)<jfc 

k , 



kY^ g(^\Liz))+Y: 
j=o L-^-l j=o 

k 




J. 

~9^'\m) 



~g^^+'\L{z)) 



k 



g(^)(L(z)) 



;(fc+i) 



(Liz)) 
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We now use standard identities for the Stirling numbers [43, §1.2.6]: 



k + l 




, fc > 0, 



'k' 


= 1 = 




- 1" 


k 


fcH 


- 1 



fc > 0, 



and 



Thus 



"fc" 


+ 


" fc ■ 




"fc + 1" 


J. 




J - 1 




. 3 . 



1 < i < fc. 



completing the induction. 



fc + 1 

i 



□ 



The left-hand side of (fO]) can hence be expressed as 

.9W(L(z))-m!g(i(z)) 



(Lg)(z) = (l-z)-('"-i)<j ^ 



m — 1 
3 , 



so that solving Lg = is equivalent to solving Lg = 0, where 

m— 1 



Lg(x) 



E 

J=0 



m — 1 



(A.5) 



But this is a linear differential equation with constant coefficients. Its indicial polynomial, or 
characteristic polynomial, is 



V'™(A) = V(A) := J2 

3=0 



m — 1 
. 3 . 



- ml = A™-i - ml, (A.6) 

the last equality following from (43, 1.2.6-(44)]. This polynomial is discussed in some detail 
in Chapter 31 and |5ll |: some useful identities involving it arc stated in Appendix From 
the discussion in [49| it follows that there are m — 1 distinct (and nonzero) roots of ip, call them 
Ai, . . . , Xm-i arranged in nonincreasing order of real parts. Thus the functions exp (Xjx) are 
m — 1 linearly independent solutions of (|A.5|I and hence the functions (1 — z)^'^^ form a basis 
of solutions to Lg = 0. 

A particular solution 

We shall find it convenient to continue working in the x-domain of IjA.Sp . There the differential 
equation equivalent to hg = h is ~ h where 



h{x) 



h{l~e-''), i.e., h{z) = {1 - z) 



-(m-l) 



KLiz)). 



(A.7) 



From the elementary theory of linear differential equations (see, e.g., 0, §5.5]), a particular 
solution gp{x) to hg = /i in an interval near a: = is given by 



, , , , r h{t)WAt) , 

Jt=o w{t) 
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where j/i, . . . , j/m-i form any basis of solutions to Lg = 0, W{x) is the Wronskian 



det 



and Wj [x) is the determinant obtained from W(x) by replacing the jth column by the column 
Gm-i := (0,0, ... ,0, 1)-^. By Cramer's rule, the column vector (VFj(a;)/VF(a;))j=i^...^m_i is the 
unique solution a to yV(a;)a — em_i, i.e.. 



(A., 



where 'Wix) :— (y'f {x)j . The general theory also shows that the initial conditions 

V ■' / i j = l,...,m— 1 

for gp all vanish, i.e., 

g(^) (0) = 0, < fc < m - 2. (A.9) 



Here yj{x) — exp (Xjx), so that j/j* ^H^) = A* ^ exp (Xjx) for i, j — 1, . . . ,ni — 1. It follows 



that 
where 



j — m— 1 ! 



V := := (A}-i),,,=i^.. ..,„_! (A.IO) 

is the Vandermonde matrix and V^^ (w*-')ij=i,...,m-i is its inverse. The inverse is well known. 
By Exercise 1.2.3-40 in 43] (but note slight differences in definitions). 



' / l<k<m~l:k^i 



[A- 



A- AW^'(AO 



[see (p:2|l and (iRijl ] 



1 



A.^'(A,) 
1 

"a,^'(ao 
1 



V fc=i 

f-m!A-(-^'+x:^ 



m — 1 
k 



am k:a-a' 



fc=i 



m — 1 



TO — 1 






k 


~ to! 







In particular, a — (ai)i=i,..._m-i of IjA.Sp satisfies 

1 



m — 1 r 



A^^'(A.) 15 



1 



TO — 1 

k 



(A.ll) 



Ar"V(A,)'"' " i^'iK 
We have thus established 
Lemma A. 2. T/ie particular solution to Jjg = h with vanishing initial conditions (through order 
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m — 2) is 



7n—l \ .J. 



dt. 



If desired, this can be transformed back to the z-domain via ljA.3|) . (|A.4p . and (jA.7p and a 
change of variables in the integral: 

Corollary A. 3. The particular solution to Tig = h with vanishing initial conditions (through 
order m — 2) is 



9p 



t V''(A,) Jo 



The initial conditions 

Having computed a basis of solutions to the homogeneous equation and a particular solution to 
the inhomogeneous equation, so far we have established Theorem 12.3.21 modulo determination 
of the coefficients ci, . . . , Cm-i at lf07|) . 

The fact that the initial conditions for ijp and for gp vanish make it simple to solve hg = h 
or Jjg = h for specified initial conditions: One need only match up the initial conditions of the 
homogeneous solutions. 

In the x-domain the general complementary solution 

m— 1 

has initial conditions 



gr = E ^j^r' = E k = i,...,m-i, 

i=i j=i 

where V is the Vandermonde matrix at (|A.10() . In matrix notation, letting 

go := (g(°)(0),...,5(™-^)(0)r 



(A.12) 



and A — {Ai, . . . , Am-i)'^ , we can write (|A.12|) as go = VA. Thus to obtain a specified go we 
must choose A ~ V^^^go- Recalling (jA.lip . 



^ m— 1 /m — 1 



m — 1 
i 



(A.13) 



The general complementary solution to — h is then given by 



9ciz) = E 



(A.14) 



Finally we turn our attention back to g. According to Lemma [A. II go = S'go, where 



go := Cgf (0),...,gr-^)(0)r 
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are the initial conditions for g and S is the matrix 

S:= (skj)^ ( J 
of (signless) Stirling numbers of the first kind. As is well known [isl §1.2.6], 



5'-! := {s''^) = {-iy+'' 



.. ffc- 1 
J-1 



(A.15) 



k,j — l,...,m—l 



where { } denotes (signless) Stirling numbers of the second kind. Thus in order for the general 
complementary solution ljA.14|) to have a specified vector go of initial conditions, we must take 
A = V~^go — V^^ S^^go- We can now establish (|2.3.7|l . Observe that if gdz) is specified as a 
power series 



then .9?^(0) = klgk for fc = 0, . . . , m - 2. 

Proposition A. 4. The constants Aj at (jA.13p are given by 



A 



, m-2 Ik) 



Proof. Observe that, by l|A.ll(l and l|A.15|l 



nil .^^ V-/ ■ 1 1 

fe=0 -'J 



m— 1 m— 1 



m— 1 m— 1 



k=l 1=1 

^ m— 1 m— 1 



"'"""-(0) ^ E E ^ E 



m — 1 
i 



#-i)(0) 



m — 1 



fc=0 ^ 



fffHo). 



Hence it is enough to establish the curious identity 

m— 1 m— 1 r 



EE 

1=1 i=l 



m — 1 
i 



i—l ( -1 



Ar'(-i) 



/-I 



A 



fe+i 



(A.16) 



for = 0, . . . , m — 2. 

For this, denote the left-hand side of (|A.16p by (fixing m and j). First, since {g} = (5„_o, 
we have 

m — 1 

— I /A ■ I 

(A.17) 



/o = E 



m — 1 



\ m — 1 



TO 

X 



as claimed. For fc > 1, using the recurrence relation pa. §1.2.6] 



n — 1 



n — 1 



/c > 1 



for Stirling numbers of the second kind and the connection 'is', 1.2.6-(47)] between the two kinds 
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of Stirling numbers we see that 

fk - -^fk + -^/fc-i + 6m-i,k, k>l. (A.18) 

With initial condition (|A.17|I the recurrence (|A.18|I can be solved to give fk — m\j}^^^ for 

fc = 0, . . . , TO — 2, as desired. □ 
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Appendix B 

Identities involving the indicial polynomial 

The indicial polynomial 

^(A) = V'™(A) A~-m! (B.l) 

plays an important role in the analysis of m-aiy search trees under the random permutation 
model. We will enumerate a few useful identities involving the polynomial in this appendix. 

It is well known 49, Chapter 3] that -0™ has m — 1 distinct roots 2 = Ai, A2, . . . , Xm~i listed 
in nonincreasing order of real part. As in 49, Chapter 3] we introduce 

a max 5R(Aj); 

2<j<m— 1 

that is, a is the real part of the root having the second largest real part among all the roots of 
the indicial polynomial HB.1|I . We list some important properties of the roots of (jB.lj) stated 
in m §3.3]: 

1. The number —m is a root if and only if m is odd. All other roots of ?A(A) are simple, 
non-real roots. 

2. No two roots have the same real part unless they are mutually conjugate. (This follows 
from the strict increasingness of |(s + in \t\.) 

The indicial polynomial can be expressed as 

ni— 1 

^(A) = n - ^j)' (B-2) 



so that 

m— 1 

^'w=E n (^-^^)- (B.3) 

j=l l<fc<m-l: k^j 

If A = Xj for some j = 1, . . . ,m— 1, then from ljB.3|) we get 



V''(A,)= n (A.-Afc) (B.4) 

l<fc<m-l: k^j 



since all other terms in the sum for ip'iXj) contain a factor that is 0. 
Identity B.l. When A ^ {Ai, . . . , Am^i}, 



E 



(A-A,)Vy(A,) ^(A) 
Proof. Using (|B.2p and partial fraction expansion we get, for A ^ {Ai, . . . , Am.-i}, 



1 1 A, 



E 



m n;"i(A-A,) ;^a-a, 
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where, using (|B.4p . 



as required. 



□ 



(r) 

For r and n positive integers, let Hn denote the rth-order harmonic number 



n 



i=i 



When r = 1 we will use iJ„ := Hn^ for the usual (Ist-order) harmonic number 
Identity B.2. 



m— 1 



^ (A, + r)2V/(A,) 



[(m - l)!]2(m - 1)2 (m-l)!(m-l)2 

(-l)V!(m-2-r)! 
[(-0 



m-l _ 1T7II2 



r = -1 



< r < TO- 2 



r > TO — 1. 



Proof. Differentiate Identity |RT] with respect to A to get 

m— 1 



E 



1 



^(A-A,)V(A,) rP^iXy 



A ^ {Ai, . . . , A,„_i}. 



(B.5) 



Differentiating the defining equation ljB.l|l we get 

m-2 



m-2 



V''w=E n (A+fc)=A™-i^ 



1 



A + J 



(B.6) 



j=0 0<fc<m-2: k^j j=0 

the last equality being valid only for A ^ {0,-1, . . . , —(to — 2)}. Hence for r > — 1, 

m— 1 



E 



1 



fr{ (A, + ryi^'iXj) i^H-r) ((-r)™-i - m!)^ 
Now using HB.6|I we see that 

m-2 ^ 



^ 1 + ? 



J=0 



= (j - r) = (-1)V!(to - 2 - r)!, < r < to - 2, 

0<i<m-2: j^r 



m-2 



1 



= ^ — = (-l)'"rn^(F, - r > TO - 1, 



and the claim follows. 



□ 
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Identity B.3. For < fc < m — 3, 



m-l \k 



Proof. We take asymptotic expansions of both sides of Identitv lB.il as A — > oo in powers of 1/A 
and equate coefficients. The left-hand side yields 



-J m— 1 



1 



m — 1 



(I t — J. OO \ OO (I I. — J. 



m-l ^fe 



A 



On the other hand, the right-hand side equals 



1 



m-2 



A™-i - m! 



n(A + fc) 



7.1 



n 1 



V fe = 



V fe=0 



7 I 



Hence the coefficient of 1/A'"'+^ for A; = 0, . . . , m — 3 in the left-hand side must vanish, proving 



the claim. 
Identity B.4. 

m— 1 

E 

Proof. When A^ is a root of ip we have from (|B.6I 

m-2 



□ 



(A, - 2W{\,) 2(m!) 



gj^^ - 1 

(i/„7 - 1)2 



Also, differentiating (|B.6p once more gives us 

'm-2 

Et4 



*"(A) = A 



m — 1 



7n — 2 ^ 

'-^ x + j I " ^ (aTTF 



so that 

V''(2) = m!(i/,„-l) and i^"(2) = m![(i/„ - 1)^ - _ 
Thus, as A ^ 2, 

V'(A) = (A - 2)m!(F„, - 1) + im![(H„, - 1)^ - - 1)](A - 2)^ + 0(|A - 2|3). 



(B.7) 



(B., 
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By Identitv |B.ll for A ^ {Ai, . . . , Am-i} we have 

m— 1 



E 

i=2 



1 



(A-A,)^'(A,) 
1 



1 



iPiX) (A-2)m!(77™-l) 
1 



{X ~ 2)m\{Hm - I) 
1 

(A-2)m!(i/„-l) 
1 



1 

^+2 



{Hm ~ 1) 



rr(2) _ 1 



H -1 



(A-2)+0(|A-2n 



- 1 



{Hm - 1) 



rr(2) _ . 
i/rT,, — 1 



(A-2) + 0(|A-2n 



2(m! 



1 



- 1 



+ 0(|A-2|); 



in particular, letting A ^ 2 yields 

m— 1 



E 



(A, -2)V'(A,) 2(m!) 



ff(2) _ 1 
iirn J- 



as claimed. 



□ 



For the next two identities we assume that m is odd. Note that when m is odd, —m is a 
root of "0™ — in fact, A,„_i = — m, as noted at the beginning of this appendix. 

Identity B.5. When m is odd, 



E 



1 



(A, +m)^'(A,) 2(m!) 

Proof. By Identity ^Q] the left-hand side above equals 
1 1 " 



1 



7^(2) _ , 
llnr J- 



- 1)2 



— lim 



'(/'(A) (A + m)V''(— rn) 



lim 



1 



(A + m)ij'{-m) + i(A + ■m)'^ijj"{-m) + 0(|A + to|3) (A + m)tp'{-m) 



= lim 



1 



A^-m (A + 'm)'il}' (—m) 
Now by (|R6|) . 



i(A + m)&^ + 0(|A + mn 
2 ip'(—m) 



V''(-m) = -m!(i/„,- 1), 



1 V;"(-m) 

2 [i/;'(-m)]2 



(B.9) 



and by iB77|) 

?/;"(— m) = m! 



E-^ 



J=0 



]-m 



m—2 ^ 

y t 

(7 — m)2 



i![(iJ„,-l)2-(i/(?)-l)], (B.IO) 



and the result follows. 



□ 
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Identity B.6. When m is odd 

m-2 



E 



1 



1 



(A,+m)V(A,) m! 

Proof. By (|B.5|) the left-hand-side above equals 
■V'(A) 1 



12^''" 3 - 1)2 + 4 - 1)3 



lim 

A— m 



V'2(A) (A + m)V(-m) 



= lim 

A— *— m 



V''(-to) + ^"{-m){X + m) + ^Tp'"{-m){X + + 0(|A + mp) 
[V''(-m)(A -f m) -f iV"(-w)(A m)2 + i^///(_to)(A m)3 0(|A -h m)4|]2 



(A + to)2-0'(— to) 



lim 

A — ^—77 



1 



(A + to)2i/i'(— to) 

1 + + + i^fe?(A + + 0(|A + TO|3) 

[1 + + -) + ^^Sfe=?(A + m)2 + 0(|A + to|3)] = 



lim 

A — > —rn 



2 i/i'(-m) 
1 



(A + •m)'^i]j' {—m) 



1 



+ + ^) + |^7(^(A + m)2 + 0(|A + to|3) 

1 + ^(A + -) + [\{^^? + ^fe?] (A + mr + 0(|A + TO|3) 



1 ip"'{~m) 



- 1 



lim 

A — > —77 



1 



(A + to)2?/;'(— to) 



+ |;f^(A + m) + i^^(A + mr + 0(|A + toH 



1 7/-"'(-777) 



V/'(-to),, 
1---TT f(A + m) + 



ip'^—m) 



i (iP"{-m)y lV"'(-m) 



= lim 

A^-i 



4 \^'(— to) / 3 %lj'{~m) 
( ( 1 V'"'(-m) 1 fiP"(-m) 



(A + TO)2 + 0(|A + TOp) 



- 1 



(A -|- to)2'(/;'(— to) \ \ 6 ip'{—m) 4 m) 
1 V"'(-m) 1 [V'"(-m)]2 



(A + to)2 + 0(|A + to|3) 



6 [V''(-m)]2 4 [V''(-m)]3 ' 
We have already computed tp'{—m) and ip" {—m) at HB.9|I and HB.lOjl . respectively. To calculate 
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ip"'{-rn) we differentiate ((Bj)l to get, for A ^ {0, -1, ... , -(to - 2)}, 



m-2 



^ ^ (A + j)3 ^ I ^ 



L j=0 

m-2 



E 



\ + i ] I ^ (A + ?)2 



'm-2 



= A"^ 



771 — 2 I ni — 2 ^ \ / 771 — 2 



rjj-2 



In particular, since —to is a root of tp when to is odd, 

r'hm) = TO![-2(i/(?' - 1) + 3(H,„ - - 1) - - 1)3]. 



Therefore 

m-2 



E 



1 TO![-2(i7i'^ - 1) + 3(i/„ - - 1) - {H„, - 1)3] 



(2) 



^(A,+to)V(A,) 6 



(to!)2(7?„-1)2 

1 - If - - 1)]^ 
4 TO!(i7„ - 1)3 

3(i/i'' - 1) 2(i/(?) _ 1) 



6(771! 



H^m - 1 [Hm - 1)^ 



4(to!) 



{Hm - 1) 



2(i/i''-l) , (i/r-1) 



(2) 



i7„ - 1 (i/„, - 1)3 

1 i/(?^-l , l(i/l')-l)2 



12^^" 3(iJ,„-l)2 ' 4 (i/,„-l)3 



as claimed. 



□ 



106 



Bibliography 



[I] M. Abramowitz and I. A. Stcgun. Handbook of mathematical functions with formulas, 
graphs, and mathematical tables, volume 55 of National Bureau of Standards Applied Math- 
ematics Series. For sale by the Superintendent of Documents, U.S. Government Printing 
OfHce, Washington, D.C., 1964. 

[2] L. Alonso, P. Chassaing, F. Gillct, S. Janson, E. M. Reingold, and R. Schott. 
Sorting with unreliable comparisons: a probabilistic analysis. Preprint. Available at 
http : //www. math. uu. se/~svante/papers/, 2003. 

[3] P. Billingsley. Probability and measure. John Wiley & Sons Inc., New York, third edition, 
1995. A Wiley-Interscience Publication. 

[4] N. H. Bingham, C. M. Goldie, and J. L. Teugels. Regular variation, volume 27 of En- 
cyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 
1989. 

[5] N. Bleistein and R. A. Handelsman. Asymptotic expansions of integrals. Dover Publications 
Inc., New York, second edition, 1986. 

[6] W. E. Boyce and R. C. DiPrima. Elementary Differential Equations. John Wiley & Sons, 
4th edition, 1986. 

[7] B. Chauvin, M. Drmota, and J. Jabbour-Hattab. The profile of binary search trees. Ann. 
Appl. Probah., 11(4):1042-1062, 2001. 

[8] H.-H. Chern and H.-K. Hwang. Phase changes in random m-ary search trees and generalized 
quicksort. Random Structures Algorithms, 19(3-4):316-358, 2001. Analysis of algorithms 
(Krynica Morska, 2000). 

[9] K. L. Chung. A course in probability theory. Academic Press [A subsidiary of Harcourt 
Brace Jovanovich, Publishers], New York-London, second edition, 1974. Probability and 
Mathematical Statistics, Vol. 21. 

[10] M. Cramer. A note concerning the limit distribution of the quicksort algorithm. RAIRO 
Inform. Theor. Appl, 30(3):195-207, 1996. 

[II] R. De Prisco and A. Do Santis. New lower bounds on the cost of binary search trees. 
Theoret. Comput. Sci., 156(l-2):315-325, 1996. 

[12] L. Devroye. A note on the height of binary search trees. J. Assoc. Comput. Mach., 
33(3):489-498, 1986. 

[13] L. Devroye. On the richness of the collection of subtrees in random binary search trees. 
Inform. Process. Lett., 65(4):195-199, 1998. 

[14] L. Devroye and B. Reed. On the variance of the height of random binary search trees. 
SIAM J. Comput, 24(6):1157-1162, 1995. 

[15] R. P. Dobrow and J. A. Fill. On the Markov chain for the move-to-root rule for binary 
search trees. Ann. Appl. Probab., 5(1):1-19, 1995. 



107 



[16] R. P. Dobrow and J. A. Fill. Rates of convergence for the move-to-root Markov chain for 
binary search trees. Ann. Appl. Probab., 5(l):20-36, 1995. 

[17] R. P. Dobrow and J. A. Fill. Multiway trees of maximum and minimum probability under 
the random permutation model. Combin. Probab. Comput., 5(4):351-371, 1996. 

[18] M. Drmota. An analytic approach to the height of binary search trees. Algorithmica, 
29(1-2) :89-l 19, 2001. Average-case analysis of algorithms (Princeton, NJ, 1998). 

[19] M. Drmota. The variance of the height of binary search trees. Theoret. Comput. Sci., 
270(1-2):913-919, 2002. 

[20] W. Feller. An introduction to probability theory and its applications. Vol. II. Second edition. 
John Wiley & Sons Inc., New York, 1971. 

[21] J. A. Fill. On the distribution of binary search trees under the random permutation model. 

Random Structures Algorithms, 8(l):l-25, 1996. 

[22] J. A. Fill and R. P. Dobrow. The number of m-ary search trees on n keys. Combin. Probab. 
Comput., 6(4):435-453, 1997. 

[23] J. A. Fill, P. Flajolet, and N. Kapur. Singularity analysis, Hadamard products, and tree 
recurrences, 2003. In preparation. 

[24] J. A. Fill and S. Janson. A characterization of the set of fixed points of the Quicksort 
transformation. Electron. Comm. Probab., 5:77-84 (electronic), 2000. 

[25] J. A. Fill and S. Janson. Smoothness and decay properties of the limiting Quicksort density 
function. In Mathematics and computer science (Versailles, 2000), Trends Math., pages 
53-64. Birkhauser, Basel, 2000. 

[26] J. A. Fill and S. Janson. Approximating the limiting Quicksort distribution. Random Struc- 
tures Algorithms, 19(3-4):376-406, 2001, arXiv:math.PR/0105246. Analysis of algorithms 

(Krynica Morska, 2000). 

[27] J. A. Fill and S. Janson. Quicksort asymptotics. J. Algorithms, 44(l):4-28, 2002, 
arXiv:niath. PR/0105248. Analysis of algorithms. 

[28] P. Flajolet. Singularity analysis and asymptotics of Bernoulli sums. Theoret. Comput. Sci., 
215(1-2):371-381, 1999. 

[29] P. Flajolet, X. Gourdon, and C. Martinez. Patterns in random binary search trees. Random 
Structures Algorithms, ll(3):223-244, 1997. 

[30] P. Flajolet and G. Louchard. Analytic variations on the Airy distribution. Algorithmica, 
31(3):361-377, 2001. Mathematical analysis of algorithms. 

[31] P. Flajolet and A. Odlyzko. Singularity analysis of generating functions. SIAM J. Discrete 
Math., 3(2):216 240, 1990. 

[32] P. Flajolet and J.-M. Steyaert. A complexity calculus for recursive tree algorithms. Math. 
Systems Theory, 19(4):301-331, 1987. 

[33] M. Foley and C. A. R. Hoare. Proof of a recursive program: quicksort. Comput. J., 
14:391-395, 1971. 

[34] N. Proman and P. O. Proman. JWKB approximation. Contributions to the theory. North- 
Holland Publishing Co., Amsterdam, 1965. 



108 



[35] P. Hennequin. Combinatorial analysis of quicksort algorithm. RAIRO Inform. Theor. 
Appl, 23(3):317-333, 1989. 

[36] P. Hennequin. Analyse en moyenne d'algorithme, tri rapide et arbres de recherche. PhD 
thesis, Ecole Polytechnique, 1991. 

[37] P. Henrici. Applied and computational complex analysis. Vol. 2. Wiley Interscience [John 
Wiley & Sons], New York, 1977. Special functions — integral transforms — asymptotics — 
continued fractions. 

[38] H.-K. Hwang and R. Neininger. Phase change of limit laws in the quicksort recurrence 
under varying toll functions. SI AM J. Comput., 31(6):1687-1722, 2002. 

[39] J. Jabbour-Hattab. Martingales and large deviations for binary search trees. Random 
Structures Algorithms, 19(2):112-127, 2001. 

[40] S. Janson. Ideals in a forest, one-way infinite binary trees and the contraction method. 
In Mathematics and computer science, II (Versailles, 2002), Trends Math., pages 393-414. 
Birkhiiuser, Basel, 2002. 

[41] S. Janson. The Wiener index of simply generated random trees. Preprint. Available at 
http : //www . math . uu . se/~svante/papers/, 2002. 

[42] C. Knessl and W. Szpankowski. Quicksort algorithm again revisited. Discrete Math. Theor. 
Comput. Sci., 3(2):43-63 (electronic), 1999. 

[43] D. E. Knuth. The art of computer programming. Volume 1. Addison- Wesley Publishing 
Co., Reading, Mass. -London-Don Mills, Ont., 3rd edition, 1997. 

[44] D. E. Knuth. The art of computer programming. Volume 3. Addison- Wesley Publishing 
Co., Reading, Mass. -London-Don Mills, Ont., 2nd edition, 1998. 

[45] D. E. Knuth and B. Pittel. A recurrence related to trees. Proc. Amer. Math. Soc, 
105(2):335-349, 1989. 

[46] D. E. Knuth and A. Schonhage. The expected linearity of a simple equivalence algorithm. 
Theoret. Comput. Sci., 6(3):281-315, 1978. 

[47] Y. Koda and F. Ruskey. A Gray code for the ideals of a forest poset. J. Algorithms, 
15(2):324-340, 1993. 

[48] W. Lew and H. M. Mahmoud. The joint distribution of elastic buckets in multiway search 
trees. SIAM J. Comput, 23(5):1050-1074, 1994. 

[49] H. M. Mahmoud. Evolution of random search trees. John Wiley & Sons Inc., New York, 
1992. A Wiley-Interscience Publication. 

[50] H. M. Mahmoud and B. Pittel. Analysis of the space of search trees under the random 
insertion algorithm. J. Algorithms, 10(l):52-75, 1989. 

[51] H. M. Mahmoud and R. T. Smythe. Probabilistic analysis of bucket recursive trees. The- 
oret. Comput. Sci., 144(1-2) :221-249, 1995. Special volume on mathematical analysis of 
algorithms. 

[52] A. W. Marshall and I. Olkin. Inequalities: theory of majorization and its applications. 
Academic Press Inc. [Harcourt Brace Jovanovich Publishers], New York, 1979. 

[53] C. Martinez and S. Roura. Randomized binary search trees. J. ACM, 45(2):288-323, 1998. 



109 



[54] A. Mcir and J. W. Moon. On the altitude of nodes in random trees. Canad. J. Math., 

30(5):997-1015, 1978. 

[55] A. Meir and J. W. Moon. On the log-product of the subtree-sizes of random trees. Random 
Structures Algorithms, 12(2):197-212, 1998. 

[56] R. R. Muntz and R. C. Uzgahs. Dynamic storage allocation for binary search trees in a 
two-level memory. In Proceedings of Princeton Conference on Information Sciences and 
Systems, volume 4, pages 345-349, 1971. 

[57] R. Neiningcr. On binary search tree recursions with monomials as toll functions. J. Comput. 
Appl. Math., 142(1):185-196, 2002. Probabilistic methods in combinatorics and combina- 
torial optimization. 

[58] A. Panholzer and H. Prodinger. An analytic approach for the analysis of rotations in 
fringe-balanced binary search trees. Ann. Comb., 2(2):173-184, 1998. 

[59] A. Panholzer and H. Prodinger. Binary search tree recursions with harmonic toll functions. 
J. Comput. Appl. Math., 142(l):211-225, 2002. Probabilistic methods in combinatorics and 

combinatorial optimization. 

[60] M. Regnier. A limiting distribution for quicksort. RAIRO Inform. Theor. Appl, 23(3):335- 
343, 1989. 

[61] J. M. Robson. On the concentration of the height of binary search trees. In Automata, 
languages and programming (Bologna, 1997), volume 1256 of Lecture Notes in Comput. 
Sci, pages 441-448. Springer, Berlin, 1997. 

[62] J. M. Robson. Constant bounds on the moments of the height of binary search trees. 

Theoret. Comput. Sci., 276(l-2):435 444, 2002. 

[63] U. Rosier. A limit theorem for "Quicksort". RAIRO Inform. Theor. Appl, 25(1):85-100, 
1991. 

[64] U. Rosier. On the analysis of stochastic divide and conquer algorithms. Algorithmica, 
29(l-2):238-261, 2001. Average-case analysis of algorithms (Princeton, NJ, 1998). 

[65] U. Rosier and L. Riischcndorf. The contraction method for recursive algorithms. Algorith- 
mica, 29(l-2):3-33, 2001. Average-case analysis of algorithms (Princeton, NJ, 1998). 

[66] W. Schachingcr. Asymptotic normality of recursive algorithms via martingale difference 
arrays. Discrete Math. Theor. Comput. Sci., 4(2):363-397 (electronic), 2001. 

[67] R. Sedgewick. Quicksort with equal keys. SIAM J. Comput., 6(2):240 267, 1977. 

[68] R. P. Stanley. Enum,erative combinatorics. Vol. 2. Cambridge University Press, Cambridge, 
1999. With a foreword by Gian-Carlo Rota and appendix 1 by Sergey Fomin. 

[69] J.-M. Steyaert and P. Flajolet. Patterns and pattern-matching in trees: an analysis. Inform, 
and Control, 58(l-3):19-58, 1983. 

[70] L. Takacs. A Bernoulli excursion and its various applications. Adv. in Appl. Probab., 
23(3):557-585, 1991. 

[71] L. Takacs. Conditional limit theorems for branching processes. J. Appl. Math. Stochastic 
Anal., 4(4):263-292, 1991. 



110 



[72] L. Takacs. On a probability problem, connected with railway traffic. J. Appl. Math. Stochas- 
tic Anal, 4(l):l-27, 1991. 

[73] K. H. Tan and P. Hadjicostas. Some properties of a limiting distribution in Quicksort. 
Statist. Probab. Lett, 25(l):87-94, 1995. 



Ill 



Vita 



Nevin Kapur was born on May 9, 1974 in Munibai (formerly Bombay), India. 

He received a Bachelor of Technology in Electrical Engineering, with a specialization in 
VLSI design and automation, from the Indian Institute of Technology, Bombay in 1996. Prom 
1996 to 1998 he was part of the Collaborative Benchmarking Laboratory at North Carolina 
State University (NCSU). He received a Master of Science from the Electrical and Computer 
Engineering department, specializing in graph algorithms and their applications to automatic 
place-and-route in VLSI design. At NCSU, while taking courses in design and analysis of 
algorithms, graph theory, and probability, he became interested in these topics and their in- 
tersection. This motivated his joining the Department of Mathematical Sciences at The Johns 
Hopkins University in 1998. 

Nevin plans on pursuing an academic career, as a researcher pursuing problems in discrete 
mathematics, probability, and their applications to computer science. 

Baltimore, 2003 



112 



